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(57) Abstract 

The present invention provides substantially purified cryptdin peptides and nucleic acid molecules encoding cryptdin peptides. The 
invention further provides methods for detecting inflammatory pathologies in a "subject and methods for treating an inflammatory pathology 
in a subject by administering a pharmaceutical composition containing a cryptdin peptide. Representative cryptdin peptides arc presented 
in the figure. 
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ANTIBIOTIC CRYPTDIN PEPTIDES AND METHODS OF THEIR USE 
This application is a continuation-in-part of 
United States Serial No . 07/930 , 649 , filed August 14, 
1992, which is a continuation-in-part of U.S. Serial No., 
5 07/889,020,. filed May 26, 1992, each of which is 
incorporated herein by reference. 

This invention was made with government support 
under g;rant numbers AI22931, AI31696, DK08851, DK44632 
and DK33506, awarded by National Institutes of Health. 
10 The Government has certain rights in the invention. 

BACKGROUND OF THE INVENTION 

FIELD OF THE INVENTION 
This invention relates generally to 
15 antimicrobial peptides and more specifically to cryptdin 
peptides, nucleic acid molecules encoding cryptdins, and 
their uses. 

BACKGROUND INFORMATION 

20 Survival in a world teaming with microorganisms 

depends on a network of host defense mechanisms. Among 
these mechanisms are phagocytosis by cells are resident 
in tissues or that circulate in the blood system and 
ingest, kill and digest potentially harmful microbes. 

25 Although pathogenic microbes may vary considerably, 
phagocytes are able tp destroy the vast majority by 
sequestering them in intracytoplasmic vacuoles and 
exposing them to a lethal . mixture, of organic and 
inorganic toxins . 

30 Perhaps the most remarkable ultrastructural 

feature of phagocytes are their several thousand, 
cytoplasmic granules, which are membrane -bound organelles 
typically about 0 . 3 /im in diameter. During phagocytosis, 
some of these granules fuse to phagocytic vesicles thus 
-35 enabling the contents of the granule to enter the lumen 
of the vesicle.' Early observers surmised correctly that 
the granules contained factors which were responsible for 
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intraphagosomal killing in digestion of microbes. These 
granules contain a mixture of antimicrobial molecules 
including various peptides such as the so-called 
defensins. 

5 Defensins are abundant antimicrobial peptide 

components of vertebrate neutrophil and macrophage 
granules. Members of the defensin family have been 
identified previously in human, rabbit, guinea pig and. 
rat phagocytes, primarily those phagocytes termed 

10 phagocytic granulocytes. Defensins are cat'ionic peptides- 
that have molecular weights between about 3 and 4 
kiloDaltons ' (kDa) and that exhibit broad-range 
antimicrobial activities against gram negative - and gram 
positive bacteria, many fungi and some enveloped viruses. 

15 . The peptides are characterized by eight invariant amino 
acids, including six invariant cysteine residues that 
. constitute a unique disulfide motif. The three disulfide 
bonds stabilize' a tertiary conformation consisting 
predominantly of G-sheet. The highly ordered structure 

20- and the absence of a helix make defensins unique among 
known antimicrobial peptides. It appears that defensins 
exert their antibacterial effect by permeabilizing the 
cytoplasmic membrane of the target microorganism ' by a 
mechanism that may involve the formation of ion channels 

25 or transmembrane pores. 

Until recently, defensins had been identified 
only in circulating or tissue phagocytes of myeloid 
origin. However, based on the presence of a particular 
mRNA, it has been surmised that similar peptides might be 

30 present in the epithelial cells of the small intestine. 
Such intestinal - peptides may prevent access of 
microorganisms through the small intestine into the 
systemic circulation and, therefore, can be useful as a 
therapeutic or prophylactic agent. Thus, a need exists 

35 to identify peptides that have antimicrobial activity 
within the mucosal epithelium or in the intestinal lumen. 
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The present invention satisfies this need- and provides 
additional benefits as well. 

SUMMARY OF THE INVENTION 

~ The present invention provides a substantially 

purified cryptdin peptide having a consensus amino acid 
sequence: 

X l -C-X 2 -C-R-X 3 -C-X 4 -E-X s -C-X s -C-C-X 7 

wherein X : is 3 to 9 amino acids; X 2 is 1 amino acid, 
preferably Y., H or R; X 3 is 2 or 3 amino acids; X« is 3 
amino acids; X s is 5 amino acids; X 6 is 6 to 10 amino 
acids; and X 7 is 0 to 9 amino acids. 

The invention also provides a substantially 
purified mouse cryptdin having a consensus amino acid 
sequence: 

X l -L-X 2 -C-Y-C-R-X 3 -C-K-X,-E-X s -G-T-C-X 6 -C-C-X 7 

20 

wherein X l is 3 or 4 amino acids, preferably LRD, LSKK 
(SEQ ID NO: 8) or LRG; 

X 2 is 1 amino acid, preferably V, L or I ; 
X 3 is 3 amino acids, preferably KGH or *RG, 
25 where * is S ( T, K ( I or A; 

X 4 is 2 amino acids, preferably GR, RR or RG; 
X s is 3 amino acids, preferably RMN, RVR , RVF 
HMN or HIN; 

X $ is 6 to 9 amino acids, preferably GIRFLY (SEQ 
30 ID NO: 3) or RNLFLTFVF (SEQ ID NO: 4), RRGHLMYTL (SEQ ID 
NO: 59) or RKGHL* YT* (SEQ ID NO: 5), where 
independently is L or M; and 

X 7 is 0 to 3. amino acids, preferably R, S or 

PRR. 

35 For example, the invention provides various 

mouse, rat or human cryptdins having the sequence:. 



8NSDC-: <wo geisorjAij^ 



10 



15 



WO 96/16075 



PCTAJS95/13328 



1) LRDLVCYCRSRGCKGRERMNGTCRKGHLLYTLCCR (SEQ ID NO: 9) ; 

2) LRDLVCYCRTRGCKRRERMNGTCRKGHLMYTLCCR (SEQ ID NO: 10); 

3) . LRDLVCYCRKRGCKRRERMNGTCRKGHLMYTLCCR (SEQ' ID NO : 11); 

4) . GLLGYCRKGHCKRGERVRGTCGIRFLYCCPR (SEQ ID NO: 12); . 

5 5) LSKKLICYCRIRGCKRRERVFGTCRNLFLTFVFCC (SEQ ID NO: 13);. 

6) LKQCHCRKFCRPYEKAEGSCRPGLFIKRKICCIQQWTPG (SEQ ID ■ 

NO: 14) ; 

7) GLLCYCRKGHCKRGSRVRGTCG I RFL YCCPRR (SEQ ID NO: 15); 

■8) LSKKLICYCRIRGCKRRERVFGTCRNLFLTFVFCCS (SEQ ID NO: 16); 
10 9) LRDLVCYCRARGCKGRERMNGTCRKGHLLYMLCCR (SEQ ID NO: 17); 
10) LKQCHCRKFCRPYEKAEGSCRPGLFIKRKICCIQQWTPGRT (SEQ ID 
NO: 18) ; 

' 11) I GRP VRRCRCRANCG P KE YATAFCAQG P F KQ FKFCCT (SEQ ID. 
NO: 19) ; 

15 12) IRWPWKRCHCRSFCRPYENATSFCAQGLFKQHKFCCLDTWPPRMK (SEQ 
ID NO: 20) ; 

13) TS GS Q ARATC YCRTGRCATRE SLSGVCE I SG RLYRLCCR TsEQ ID 

NO: 21) ;. and 

14) AFTCHCRRSCYSTEYSYGTCTVMGINHRFCCL (SEQ ID NO: 22) . 

20 Cryptdins are typically characterized by being 

naturally ■ found in the epithelial cells of the small 
intestine, being cationic, being about 30 to about 45 
amino acids in length, having at least, -three and, 
preferably, three to nine, amino acids to the N-terminal 

25 of the first cysteine residue, -exhibiting specific 
antimicrobial activity against .intestinal pathogens and 
opportunistic pathogens and being relatively non-toxic to 
cells of the host, organism. "However, there may be 
diversity in these structural . and functional 

30 characteristics. The invention also provides cryptdin 
. analogs, which are devoid of one or more amino acids N- 
terminal to. the first cysteine. In addition, 'the 
invention also provides nucleic acid molecules encoding 
cryptdin peptides'. For example, the invention provides 

3 5 genomic DNA sequences and cDNA sequences encoding mouse 
and rat cryptdins. 
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The invention further provides a method for 
detecting an inflammatory pathology in a subject by 
determining the amount of cryptdin in' a biological sample 
from the subject and comparing that amount to the amount 
5 present in a normal subject. Such a method can be used 
to determine the presence of an inflammatory pathology 
such . as inflammatory bowel disease, pancreatitis, 
malignancy, infection or ileitis. 

The invention also provides a method for 

10 treating an inflammatory pathology in a subject by 
administering a cryptdin to the subject.' Such treatment 
is particularly advantageous in patients who are 
immunocompromised due, for example, to malnutrition, 
radiation, burns , immunosuppressive infections, autoimmune 

15 disease, neonatality, bone marrow transplantation or 
chemotherapy. A cryptdin can be administered orally, by 
nasogastric intubation, by transabdominal catheter, 
intravenously or by aerosol inhalation. When 
administered orally, it is preferably in a delayed 

20 release formulation designed to permit release in the 
small intestine. The cryptdin can be administered as a 
composition with a physiologically acceptable medium, and 
more than one cryptdin can be administered simultaneously, 
or sequentially. 

25 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 provides the structures of mouse 
cryptdins 1-5' (SEQ ID NOS : 9 to 13, respectively) and rat 
cryptdin 1 (SEQ ID NO: 14) , Amino acid residues are 
.30 indicated by single letter code. Dashed lines are 
included in . mouse .cryptdin 4 (SEQ ID NO: 12) and rat 
cryptdin 1 (SEQ ID NO: 14) in order to preserve the 
consensus sequence where these peptides are shorter than 
• other cryptdins. Invariant . residues in the enteric 
35 cryptdin peptides are boxed. Disulfide bonding motifs 
are depicted by connecting double lines. 
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Figures 2. A. to 2.C. show chromatograms 
representing the purification of enteric cryptdins. Acid 
extract of jejunum and ileum was chromatographed in 30% 
acetic acid on a P-60 column. Fractions indicated by. the 
5 bracket (Figure 2.A.). were pooled and rechromatographed * 
on the P-60 column (Figure. 2.B.). .Cryptdin containing 
fractions (bracket,' panel B) were pooled and further 
.purified by reversed-phase high performance' liquid 
chromatography (RP-HPLC) on 0.4 6 x 2 5 cm Vydac C-18 

10 column. Water-acetonitrile gradient elution (--). using 
0.13% (vol/vol) HFBA as modifier was used to" purify 
cryptdins , 1-5 (SEQ ID NOS : 9 to 13, respectively). The 
brackets in Figure 2.C. indicate the peptide contained in 
each peak, and the portion of each which was subjected to 

15 a second round of RP-HPLC . . " 

Figure 3 shows acid-urea PAGE of purified 
enteric cryptdins. Samples of low molecular weight 
enteric peptides obtained by P-60 gel filtration (Figure 
2, panel B) and purified cryptdins were electrophoresed 

20 on a 12.5% .acid-urea gel and stained with formalin- 
containing Coomassie Blue. Lane A: approximately. 20 jj.g 
P-60 low molecular weight peptide fractions; lanes B-F: 
1 Mg each of cryptdins 1-5 (SEQ ID NOS: 9 to 13), 
respectively. 

25 Figures 4. A. and 4.B. compare mouse cryptdins 

1-5 (SEQ. ID . NOS: 9 to 13, respectively) and partially 

purified luminal peptides. 

Figure 4. A. Lyophilized luminal lavage of 

small intestine from 12 mice and 20 /ig. protein was' 
30 fractionated by P-60 gel filtra.tion and electrophoresed 

on an acid-urea acrylamide gel (lane 2) along side a 
.similarly prepared sample of bowel tissue (lane 1) . The 

positions of cryptdins 1-5 (SEQ ID NOS : 9 to 13, 

respectively), are indicated. 
35 Figure 4.B. Partially purified luminal 

peptides (20 \xq ; as for Figure 4. A., lane • 2 ) were 

electrophoresed in a second, acid-urea gel (lane 3) along 
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• with an identical sample previously treated with 
performic acid (lane 4). In lane 4, rapidly migrating, 
cyst (e) ine-containing peptides are absent due to the 
increased net negative charge- resulting from the 
5 conversion of cyst(e)ines to. cysteic acid residues. 

Figure 5 shows the identification of mouse 
cryptdins 1-5 (SEQ ID NOS : 9 to 13 , respectively) in 
small intestine epithelium. Acid extracts of intact, 
whole small intestine (W) or epithelial sheets (E) were 

10 lyoph-ilized, dissolved in sample solution and resolved on 
a 12.5% acid-urea acryiamide gel. Cryptdins 1-5- (SEQ ID 
NOS: 9 to 13) are identified numerically. 

Figures 6. A. to 6.F. show. the 
immunohistochemical localization of cryptdin 1 (SEQ ID 

15 NO: 9) in small intestine. Full thickness sections of 
adult mouse jejunem were incubated with preimmune 
(Figures 6. A., 6.C. and 6.E.) or ant i -cryptdin C rabbit 
IgG (Figures 6.B., 6.D. and 6.F.) and developed using 
peroxidase anti-peroxidase secondary. 

20 antibody magnifications: Figures 6. A. and 6.B., 40X; 
•Figures 6 .C. and 6 .D. , 2 5 OX; Figures 6 . E . and 6 . F . , '64 OX. 

Figures 7. A. and'7.B. depict the antimicrobial 
activity of mouse cryptdin 1 (SEQ ID NO: 9).. Samples of 
purified natural mouse cryptdin 1 (Figure 7. A.) or rabbit 

25 NP-1 (Figure 7.B.) were dissolved in 0.01% acetic acid 
and pipetted into wells produced in a 0.6% agarose/0.3% ■ 
tryptone plate . containing 1 X 10 s log phase bacterial 
cells.' After incubation at 37°C for 18 hr, antimicrobial 
activity was evaluated by measuring the diameters of the 

30 clear zones.- Closed circles denote wild type 

S. typhimurium.; open circles denote the phoP* mutant. 

Figure 8 shows the amino acid , sequences for rat 
cryptdins 1-3 (SEQ ID NOS: 18-20, respectively), human . 
cryptdins 5 and 6 (SEQ- ID NOS : 21 and 22; HD-5 and HD-6) 

35 and a consensus sequence (Def consensus) . Also shown are 
the amino acids sequences for rat prepro-cryptdins 1-3 
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(SEQ ID NOS: 60-62) as deduced from cDNA or genomic DNA 
sequences as . indicated . 

Figures 9. A. and 9.B. show the amino acid 
sequences of mouse cryptdins 1-17 {SEQ ID NOS: 9-11, 15- 
5 17 and 23 -33, respectively) as determined from the cDNA 
sequences encoding the specific cryptdin. 

Figure 9. A. shows the entire amino acid 
sequence of the mouse cryptdins. The amino acid 
• sequences of cryptdins 1-6' (SEQ ID NOS: 9-11 and 15-17) 
10 • were determined by sequencing the purified peptides. The- 
amino acid sequences of cryptdins 7-17 (SEQ ID NOS: 23-33) 
were deduced from the cDNA sequences (see Figure 10) . 
The amino acids encoded by Exon 1, which encodes the 
signal peptide and propiece, and Exon 2, which encodes 
15 the mature cryptdin peptide, are indicated. A dot 
indicates the sequence was not encoded by the cDNA clone. 

indicates a space, which preserves the homology of 
the sequences. 

Figure. 9.B. indicates the degree of relatedness 
20 of the mouse cryptdins. Amino acids that are identical 
to the amino acid shown for cryptdin 1 (SEQ ID NO: 9) are 
indicated by a dot. 

Figure 10 shows the nucleic acid sequences for 
the cDNA sequences encoding mouse cryptdins 1-17 (SEQ ID 
25 NOS: 34-50, respectively). A consensus nucleotide 
sequence . also is shown (SEQ ID NO: 51) . A dot indicates 
the nucleotide is the same as shown for cryptdin 1. The 
amino acid sequence for cryptdin 1 .(SEQ ID. NO: 9) is 
shown above the nucleic acid sequence. Numbers below the 
30 nucleotide sequence indicate the nucleotide position 
relative to the methionine start codon (+1). Numbers 
above the amino acid sequence indicate the amino acid 
position. Italics indicate the mature cryptdin peptide 
. sequence. Nucleotides in lower case letters indicate 
35" non-coding sequences. «#**•• indicates a stop codon. 
n (A) n " indicates poly-A tail. " * " indicates a space and 



8NS0O0ID: <WO 9616075AI J_» 



WO 96/16075 



PCT/US95/ 13328 



9 

indicates the particular nucleotide could not be 
determined unambiguously. 

Figure 11 shows the genomic . DNA sequences for 
mouse cryptdins 1, 2, 3, 5 and 6 (SEQ ID NOS : 53 -57, 
5 respectively) and the genomic sequence for the apparently 
inactivated mouse cryptdin i. gene (Crypi; SEQ ID NO: 58) , 
in which a stop codon (TGA) is substituted for a cysteine 
residue. Numbering is as described in the legend to 
Figure 11. The upper sequence represents a consensus 

10 cryptdin gene sequence (SEQ ID NO: 52). "X" indicates 
positions at which at least two sequences containing 
nucleotide changes. The TATAAA box is shown in lowercase 
italics; exons are shown in capital letters; "**" 
indicates intron DNA; "n" represents approximately 500 

15 base pairs that were not sequenced. "@" indicates the 
start of the : cryptdin peptide coding region at codon 59. 
Coding sequences are indicated in bold print. Prepro- 
regions are coded by nucleotides 1-172; cryptdin peptides 
are coded by nucleotidase . 173 -279 . The stop codon is 

•20- underlined. 

GenBank . accession numbers for these sequences 

are 002994 (cryptdin 1, exon 1); 002995 (cryptdin -1, exon 

2); 002996 (cryptdin 2, exon 1); 002997 (cryptdin 2, exon 
2); 002998 (cryptdin 3, exon 1); 002999 (cryptdin' 3, exon 

25 2); 003000 (cryptdin 5, exon 1) ; 003001 (cryptdin 5, exon 
2); 003002 (cryptdin 6, exon 1) ; 003003 (cryptdin 6, exon 
2); 00.3004 (cryptdin i, exon 1); and 003005 (cryptdin i, 
exon 2 ) . 

Figures 12. A. to 12. C. demonstrate the 
30 effectiveness of mouse cryptdins (as indicated) in 
inhibiting the growth of £. coli ML35 cells in an agar 
diffusion assay. 

Figures 13 .A. to 13. C. demonstrate the 
effectiveness of mouse cryptdins (as indicated) in 
35 killing E. coli cells in suspension. 

Figures 14 .A. to 14. C. show the cDNA sequences 
encoding .rat cryptdin l (Figure 14. A.), rat cryptdin 2 
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(Figure 14. B.) and rat cryptdin 3 (Figure 14.C). 
Nucleotide numbers are indicated. 

Figures 15. A. to 15. C. show the genomic DNA. 
sequences encoding rat cryptdin 1 (Figure .15.. A. } rat 
5 . cryptdin 2 (Figure 15. B.) and rat cryptdin 3 (Figure 
-15.C). Nucleotide numbers are indicated. 

DETAILED DESCRIPTION OF THE INVENTION 
The invention provides small peptide molecules, 
10 termed cryptdins, which express a broad range of 
antimicrobial activity, particularly. against intestinal 
pathogens, and for this reason are useful ' antimicrobial 
agents. For example, cryptdins have antimicrobial 
activity against gram negative and gram positive bacteria 
15 and against protozoan pathogens (see Example III) . 
Cryptdin peptides and nucleic acid sequences encoding 
cryptdins were isolated from the small intestine "and are 
active within the. epithelial lining of the small 
intestine and within the lumen of the intestine. Because 
20 it is indicative of .inflammatory processes, the presence 
of cryptdins can be utilized in the diagnosis of a wide 

range of inflammatory conditions. 

As used herein, the term •'cryptdin" or "enteric 
defensins" refers to peptides having generally .between 
25 about 30 and 45 amino acids. Cryptdins are 

characterized, in . part, by a consensus sequence 
containing six cysteine residues. Illustrative sequences 
are provided in Figure 1, which shows invariant residues 
and the disulfide bonding motif. In. addition/ those 
30 residues which are preferably invariant are identified 
(see, also, Figures 8 and 9) . 

Cryptdins are further characterized by their 
cationic charge and their broad range of antimicrobial 
activity. While related to leukocyte -derived defensins, 
35 cryptdins are distinguished from these other molecules by 
the presence of 3 to 9 amino acids N-terminal to the 
first cysteine molecule. Cryptdins may have C-terminal 
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extensions as well. In . addition, they exhibit 

antimicrobial activity against enteric microorganisms, 
which can become blood-borne pathogens if the intestinal 
barrier is breached. Since cryptdins are associated with 
5 the secretory granules of Paneth cells in the small 
intestine, they can be secreted from the cells in which 
they are produced (Satoh, Cell Tiss . Res. 251:87-93 
(198 8) ; Satoh et al., Acta Histochem. 8 3 : 18 5-188 ( 1988) ) . 
Unlike leukocyte -derived defensins, cryptdins are not 

10 toxic to mammalian cells. 

It ■ should be appreciated that various 
modifications can be made to the cryptdin amino ' acid 
sequence without diminishing the antimicrobial activity 
of the peptide. It is intended that peptides exhibiting 

15 such modifications, including amino acid additions, 
. deletions or substitutions are within the meaning, of the 
term "cryptdin" and, therefore, within the scope of the 
invention. For example, cryptdin analogs, which, are 
devoid of one or more amino acids N-terminal to the first 

20 cysteine residue, are within the present invention. Such 
cryptdin analogs can be synthesized using well known 
methods (see Example VI) or - can be purified from the 
intestine where they may occur naturally due, for 
example, to partial proteolysis of a cryptdin peptide in 

25 the intestinal lumen. 

Use of the phrase "substantially pure" in the 
present specification and claims as a modifier of 
peptide, protein or nucleic acid means that the peptide, 
• protein or nucleic acid so designated has been separated 

30 from its in vivo cellular environment. As a result of 
the separation and purification, the substantially pure 
peptides, proteins and nucleic acids are useful in ways 
that the non-separated impure peptides, proteins and 
nucleic acids are not." 

35 The cryptdin peptides of the present invention 

preferably contain between about 30 and 45 amino acids 
(see Figures 1, 8 and 9). Cryptdins can be synthesized 
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by methods well known in the art, such as through the use 
of automatic peptide synthesizers or by well-known manual 
methods of- peptide synthesis (see Example VI). i n 
addition, they can be purified from natural sources .such 
5 as- small intestinal' epithelium of vertebrate, .preferably 
mammalian, origin (see Example I) . Such epithelium can 
be obtained, for example, from rats, mice or humans using 
means well known to those skilled in the art. 

As disclosed herein, various cryptdin peptides 

10 were isolated from intestinal epithelium, purified by 
chromatographic - methods and characterized by 
electrophoresis and amino acid sequencing. Crypt din's 
were identified by their rapid migration on acid-urea 
PAGE and by their apparent molecular weight of about 

15 4 kDa (see Examples I and II). 

Anti -cryptdin antibodies were made using 
methods conventional in the art. For example, polyclonal 
antiserum can raised in appropriate animals, such as 
rabbits, mice or rats. Cryptdin peptides, either 

20 synthetic or obtained from natural sources, can be used 
to immunize the animal. As described in Example IV, a' 
cryptdin analog, cryptdin C, which corresponds to 
residues 4-35 of mouse cryptdin 1 (SEQ* ID NO: 9) as shown 
in Figure 1, was used to immunize rabbits using well 

25 known methods. Serum samples were collected until the 
ant i -cryptdin titer was appropriate. Various fractions 
of the antiserum, such as IgG, can be isolated by means 
well known" in the art. Cryptdin immunogens also can be 
used to obtain monoclonal antibodies using methods well 

30 known in the art (see, for example, Harlow and Lane, 
• Antibodies: A Laboratory- Manual (Cold Spring Harbor. 
Laboratory Press 1988) , which is incorporated herein by 
reference) . 

The antimicrobial activity of a cryptdin can be 
35 measured against various pathogens. As disclosed in 
Example III,, various microorganisms were grown, to an 
appropriate concentration, mixed with an appropriate 
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medium such as an agarose- trypticase soy medium and 
contacted with a cryptdin. Antimicrobial activity was 
apparent, for example, from the clear zones that 
surrounded the cryptdins in an agar diffusion assay. The 
5 area of the clear zones was concentration dependent (see . 
Figure 12) . 

Anti-cryptdin antibodies, can be used to 
determine the presence of cryptdin in a. biological sample 
such as a histological sample. For example, sections of 

10 small intestine are fixed by means well known to those 
skilled in the art and incubated with anti-cryptdin 
" antibodies such as an IgG fraction of antiserum, if 
desired, the anti-cryptdin antibody can be detectably 
labelled or an appropriate detectable second antibody can 

15 be used to identify the presence of the primary antibody 
attached to the cryptdin. Means of detection include the 
use of radioactive protein A or enzyme substrates such, as 
peroxidase (see Harlow and Lane, supra , 1988) . 

Alternative methods of determining the presence 

20 of cryptdin in a biological sample . obtained, for example, 
by intestinal lavage or by disrupting cells or tissues 
can be useful to determine the presence of inflammatory 
processes. In the presence of inflammatory processes, 
the concentration of cryptdins is significantly altered 

25 from that found in the normal cell. ..In particular, a 
deviation from the normal level of cryptdins by one to 
two standard deviations is indicative of an inflammatory 
process. Such an inflammatory process can include, for 
example, inflammatory bowel disease, pancreatitis, 

30 malignancy, infection or ileitis. 

Because of their broad range of antimicrobial 
activity and their ability to function within the 
intestinal epithelium or lumen, cryptdins are potent 
therapeutic agents for infections of the intestine. In 

35 particular, cryptdins are useful where the subject is 
immunocompromised due, for example, to malignancy, 
malnutrition, chemotherapy, radiation, immunosuppressive 
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viruses, autoimmune disease or neonatality . In addition, 
cryptdins are useful in surgical prophylaxis, for 
example, by functioning' to help' sterilize the small 
bowel. Thus, cryptdins can be useful as medicaments for 
5 treating a subject having a pathology characterized, in 
part, by an ■ inflammatory process. 

A cryptdin, either purified from natural 
sources or synthetic, can be administered to a subject in 
need of such therapy by various means, including orally, 

10 preferably in a slow-release type formulation, which will 
avoid release within the stomach. Alternatively, 
cryptdins can be administered through nasogastric 
intubation, transabdominal catheter, intravenously or 
aerosol administration. Individual species of cryptdin 

15 can be administered alone or in combination. Cryptdins 
administered in combination can be administered 
simultaneously or sequentially and can be repeated as 
necessary. 

Prior to the characterization of a mouse • 

20 intestinal defensin cDNA, expression of defensins was 
thought to be limited to professional phagocytes , i . e . , 
neutrophils and macrophages. The presence of high levels 
of cryptdin mRNA in Paneth cells led to the hypothesis 
that defensins synthesized in intestinal epithelium may 

25, contribute to antimicrobial barrier function in the small 
bowel (Ouellette et al,, J. Cell Biol. 108:1687-1695 
(1989a) , which is incorporated herein by reference) . 
Isolation and characterization of six mouse cryptdin 
peptides, two rat cryptdin peptides and 2 human cryptdin 

30 peptides, and the demonstration of antimicrobial activity 
of various cryptdin peptides indicates that the cryptdins 
have an antimicrobial role in the small intestine. The 
immunohistochemical localization of cryptdin (s) to Paneth - 
cells is consistent with previous in sicu hybridization 

35 analysis and suggests that defensins ' produced by these 
cells may contribute to restricting the colonization and 
invasion of the small bowel by bacteria. 
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Inicial efforts to purify intestinal defensins 
focused on the isolation of mouse cryptdin 1 (SEQ ID NO: 
9) , the peptide predicted from the cryptdin cDNA 
sequence. Since the deduced structure of the peptide is' 
5 highly cationic, intestinal peptides were solubilized by 
homogenizing intact mouse jejunum and ileum in 30% formic 
acid. Acid-urea PAGE of the crude extract revealed 
several bands with R { values similar to those of rabbit 
defensin NP-1 and. cryptdin C, a folded synthetic defensin 

.10 congener corresponding to residues 4 to 35 in cryptdin 1 
(SEQ ID NO: 9) . Peptides corresponding to these bands 
were purified approximately 200-fold by sequential gel 
filtration chromatography on Bio-Gel P-60 (Figures 2. A. 
and 2.B.). Electrophoresis of P-60 column fractions on 

15 acid-urea gels showed that five fractions elirting between 
two prominent peaks (Figures 2. A. and 2.B., brackets) 
contained putative cryptdin peptides (Figure 3, lane a) . 
Peptides in these P-60 fractions migrated with an 
apparent molecular mass of approximately 4 kDa on SDS- 

20 PAGE (not shown) , consistent with the molecular weight of 
defensins. Furthermore, treatment of P-60 fraction 
samples with performic acid reduced the electrophoretic 
mobility of the five putative mouse cryptdins in acid- 
urea gels, behavior that is characteristic of defensins 

25 and polypeptides that contain multiple cysteine residues. 

Defensins in pooled P-60 fractions were 
purified further using sequential rounds of RP-HPLC 
utilizing different ion-pair agents. Initial HPLC 
fractionation utilized water- ace tonit rile gradients 

30 containing 0.13% heptaf luorobutyric acid (HFBA) as the 
ion-pairing agent, whereby each of the five peptides 
contained in ' the pooled P-60 fractions, was. resolved to 
near purity in a single run (Figure 2.C.). Complete 
purification of five peptides, mouse cryptdins 1-5 (SEQ 

35 ID NOS: 9 to 13, respectively)/ was' achieved by- 
subsequent RP-HPLC using. 0.1% tri f luoroacet ic acid (TFA) 
(Figure 3, lanes B-F) . Assuming extraction of individual 
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peptides is equally efficient, both acid-urea gel 
electrophoresis and RP-HPLC of the P-60 fractions 
containing putative cryptdins showed that the relative 
abundance of the peptides is cryptdin 1 > cryptdin 2 > 
5 cryptdin 5 > cryptdin 3' > cryptdin 4. The relative 
• amounts of cryptdins. 1-5 {SEQ ID NO: 9 to. 13, 
respectively) have been qualitatively reproducible in 
every preparation of acid-extracted protein from mouse 
small intestine. 

10 Using a modification of the method described 

above, mouse cryptdin 6, rat cryptdin 2, and human 
cryptdins 5 and 6 also were isolated (see Examples I and 
II; see, also, Figures 8 and 9). In addition, longer 
forms of mouse cryptdins 4 and 5 (compare SEQ ID NOS: 12- 

15 and 15; 13 and 16) and rat cryptdin 1 {compare SEQ ID- 
NOS: 14 and 18) were obtained. This result suggests that 
the initial method of purifying cryptdin peptides 
resulted in partial degradation of- the C-termini of some 
peptides. Significantly, both forms of the purified 

20 cryptdin peptides have antimicrobial activity. 

Biochemical characterization of the isolated 
cryptdins demonstrated that these peptides are defensins . 
Amino acid analysis of each peptide showed their 
compositions (cationic peptides of about 30 to 45 amino 

25 acid residues, including 6 half -cysteines) are compatible 
with def ensin-like molecules. The complete sequences of 
mouse cryptdins 1-6 (SEQ ID NOS; 9 to 11 and 15 to 17), 
rat cryptdins 1 and 2 (SEQ ID NOS : 14, 18 and 19) and 
human cryptdins 5 and 6 (SEQ ID NOS : 21 and 22) were 

30 determined by automated Edman degradation and, in some 
cases, by amino acid analysis of carboxyl terminal 
chymotryptic peptides (see Figures 1, 8 and 9) .' The 
primary structures of the cryptdins ■ contain the 
distinctive structural features of human, rabbit, rat and 

35 guinea pig neutrophil defensins (Lehrer et al . , Cell 
64:229-230 (1991a), which is incorporated herein by 
reference), i.e., the six invariant cysteine residues and 
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the glycine and glutamic acid in positions that are 
highly conserved in myeloid defensins. 

The cryptdin peptides disclosed herein contain 
features that are unique and distinct from defensins of 
5 myeloid origin. For example, mouse cryptdins 1, 2, 3 and 
6 (SEQ ID NOS: 9 to 11 and 17, respectively) are almost 
identical, differing only at two or three positions (see. 
Figure 9.A.). Analysis of codons from which these amino 
acid differences could arise shows that the conversion, 

10 ■ for example, of Ser 10 to Lys 10 in cryptdin 1 (SEQ ID NO: 9) 
and cryptdin 3 (SEQ ID NO: JL1) , respectively, requires 
two nucleotide substitutions. On the other hand, single 
nucleotide changes in the codon encoding Thr 10 in cryptdin 
2 (SEQ ID NO: 10) could give rise to cryptdins 1, 3 and 

15 6, suggesting that the cryptdin 2 gene may be an 
intermediate or progenitor of the cryptdin 1, 3- and 6 
genes. Similarly, a. single nucleotide change in the 
codon for Thr 10 of cryptdin 2 can account for the deduced 
amino acid at position' 10 in cryptdins 7-17 (see Figure 

20 10, nucleotides 203-205,- SEQ ID NOS : 34 -50). 

By homology with the structures ' of known 
myeloid defensins, the cryptdin 1 N- terminus was 
predicted to begin at Leu 4 ' or Val s , which is 1-2 residues 
prior to the first conserved cysteine. However, compared 

25 to myeloid defensins, cryptdins have variably extended 
N-termini that contain from three (mouse cryptdin 4, SEQ 
ID NO: 12;- rat cryptdin 1, SEQ ID NO: 14) to nine (human 
cryptdin 5, SEQ ID NO: 21) amino acids preceding the 
first cysteine. In mouse cryptdins 1-3 and 6^17 (SEQ ID 
.30 NOS: 9 to 11, 17 and 23-33 respectively), the N-peptidyl 
extensions consist of two charged internal residues, 
flanked by amino acids with hydrophobic sidechains. 
Since natural variation in defensin amino termini 
correlates with relative antimicrobial potency in vitro 

3 5 (Ganz et al . , J. Clin/ Invest. 76: 14 27- 14 35 (1985), which 
is incorporated herein by reference) , the extended 
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N-termini of- enteric defensins may have evolved for a 
unique role in the bowel. 

Mouse cryptdin 4 (SEQ ID NO: 12) , the most 
cathodal and, apparently, least abundant mouse enteric 
5 defensin, was the first defensin found to contain a chain 
length variation between the fourth and fifth cysteine 
residues. Unlike the majority of previously known 
- defensins, in which nine amino acids separate the fourth 
and fifth cysteines (Lehrer et al., supra, 1991a), mouse 

10 cryptdin 4 (SEQ ID NO: 12) contains only six residues 
between the same two amino acids (Figure 1) . In 
'addition, rat cryptdins 1-3 (SEQ ID NOS : 14 and 18-20) 
contain ten amino acid residues between the fourth and 
fifth cysteines-. These findings indicate the defensin 

15 fold involving this stretch of the peptide chain can 
accommodate significant variability in the size of the 
loop, as compared to the invariant loop size defined by 
crystal and NMR structures, respectively, of human and 
rabbit neutrophil defensins. Also, rat - cryptdins 1-3 

20 (SEQ ID NOS: 14 and 18-20) are the only, cryptdins 
containing three, instead of four, amino acid residues 
between the second and third cysteine residues. 

Since cryptdin mRNA levels increase during 
postnatal development of mouse small bowel (Ouellette et 

25 al., supra, 1989a), it was investigated whether 
accumulation of enteric defensins was regulated 
similarly. Analysis of intestinal acid extracts from 
male and female mice showed that mouse cryptdins 1-3 and 
5 (SEQ ID NOS: 9 to 11 and 13, respectively) are present 

30 in adult mice, regardless of gender. On the other hand, 
extracts from 9 day-old mice lack the peptides, 
consistent with postnatal accumulation of cryptdin MRNA. 

Mouse cryptdins 1-5 (SEQ ID NOS : 9 to 13) were 
purified from intestinal epithelial cells. In the 

35 presence of EDTA , the intestinal epithelium no longer 
adheres to the underlying basement membrane, and floats 
free of the lamina propria upon gentle agitation 
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(Bjerknes and Cheng, Am. J. Anat . 160:51-63 (1981), which 
is incorporated herein by reference) . Preparations of 
epithelial sheets isolated in this manner were 
concentrated by low speed centrif ugation and extracted 
5 with 30% formic acid. Peptides extracted from isolated 
epithelial sheets comigrate with cryptdins 1-5 (SEQ ID 
NOS: 9 to 13) when analyzed by acid-urea PAGE (Figure 5), 
demonstrating their epithelial origin. 

Tmmunoperoxidase staining of full - thickness 

10 sections of small intestine with an anti -cryptdin 
antibody demonstrate cryptdin antigen in Paneth cells, 
consistent with localization of cryptdin mRNA by in situ 
hybridization (Ouellette et al . , supra, (1989a)). 
Incubation of sections of adult mouse jejunum and ileum 

15 with a polyclonal anti-cryptdin IgG produced by rabbits 
immunized with the synthetic congener cryptdin C 
localized the immunoperoxidase reaction to granulated 
cells,, morphologically defined as Paneth cells, at the 
base of every crypt (Figure 6). The staining pattern 

20 accentuates the granular appearance of the cytoplasm in 
these cells and the immunoreact ivity appears particularly 

strong over Paneth cell granules. The' antibody is 

specific for mouse cryptdin(s), since it is negative both 
for rat and human Paneth cells (data not shown) . 

25 Leukocytes in the lamina propria of the villi also were 
negative, suggesting that related enteric defensins are 
not expressed by phagocytes or lymphocytes. Because of 
the extensive similarity of mouse cryptdins 1-3 (Figure 
1,^ SEQ ID NOS: 9 to 11), the polyclonal antibody produced 

30 against cryptdin C probably recognizes the. three 
peptides. Conversely, because mouse cryptdin 4 (SEQ .ID 
NO: 12) and cryptdin 5 (SEQ ID NO: 13) differ markedly 
from the other mouse cryptdins, the anti-cryptdin C 
antibody is unlikely to react with cryptdin 4 (SEQ ID NO:. 

35 -12) and cryptdin 5 (SEQ ID NO: 13), leaving their origin 
in Paneth cells somewhat unresolved. 
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Immunohistochemical data suggest crypcdins are 
secreted into 'the intestinal lumen* Material in the 
small intestinal lumen is strongly positive for the 
antibody but negative for pre- immune sera or IgG (Figures 
5 6. A. and 6.B.). Although the agonist for Paneth cell 
defensin secretion is unknown, ■ lysozyme, another protein 
constituent of Paneth cell granules, is secreted into the 
lumen under cholinergic regulation. Consistent with . 
immunochemical detection of anti -cryptdin C positive 

10 material in the intestinal lumen, acid-urea PAGE of 
saline washes of adult jejunum and ileum contain peptides * 
with mobilities very similar co but distinct from the 
mobility of cryptdins ."( Figure 4). Nevertheless, the 
peptides are not identical to crypcdins 1-5 {SEQ ID NOS : 

15 9 to 13 , respectively) by either migration in acid-urea 
PAGE or by HPLC analysis, suggesting they may correspond 
to cryptdins that have been processed further. 
Conceivably, luminal cryptdin or cryptdin - like material 
could derive from exfoliated Paneth cells in the lumen, 

20 but the low rate of Paneth cell turnover suggests this is 
unlikely. The release of cryptdins or processed variants 

— - into the small bowel by Paneth cells contrasts with the 
apparent lack of defensin secretion by leukocytes , ' and it 
is inferred that a secretory pathway may exist for the 

25 constitutive delivery of defensins into the intestinal 
lumen by Paneth cells. 

The antibacterial activity of purified mouse 
cryptdins 1-5 (SEQ ID NOS: 9-13) was tested against 
wild type, and phoP mutant S. typhimurium using a modified 
■ 30 plate diffusion assay (Lehrer et" al .. , J. Immunol. Methods - 
137:167-173 (1991b), which is incorporated herein by 
reference) . phoP is a . two - component regulatory locus 
that is essential- to S. cyphimuriwn virulence and 
survival within macrophages (Fields et al . , Science 

35 243:1059-1062 (1989); Miller et al . , Proc . Natl . Acad. . 
Sci . , USA 86:5054 -5058 (1989), each of which is 
incorporated herein by reference). Mutants in the phoP 
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locus are particularly sensitive to rabbit defensins NP-1 
and NP-2 when compared to wild type parent strains 
(Fields et al., supra, 1989; Miller et al. ( Infect . 
Immun. 58:3706-3710, (1990), which is incorporated herein 
5 by reference) . 

Under assay conditions using a phosphate buffer 
as described in Example III, the antimicrobial activity 
of rabbit def ensin -WP-1 against wild type and the phoP 
mutant organisms was quite similar (Figure 7.8.) . On the. 

10 other hand, at concentrations of mouse cryptdin 1 (SEQ ID 
NO: 9) that are effective against the attenuated mutant, 
wild type 5. typhimurium is completely resistant to the 
effects of the peptide (Figure 7. A.). 

The differential activity of cryptdin 1 (SEQ ID 

15 NO: 9) against avirulent S. typhimurium suggests that 
resistance to mucosal defensins may be important for the 
evolution of virulence in enteric pathogens. However, in 
experiments using HEPES or PIPES as buffers as described 
in Example III, concentrations of 100 fxq/ml or 300 /ig/ml 

20 cryptdin 1 were as effective as NP-1 in inhibiting the 
growth of wild type 5. typhimurium. Furthermore, at 
these concentrations, cryptdins 4 and 5 were more 
effective 'than NP-1 in preventing the growth of mutant' 
and wild type S. typhimurium (not shown) . 

25 The present invention also provides 

substantially purified nucleic acid sequences encoding 
cryptdins. For example, the cDNA sequences for mouse 
cryptdins 1-17 (SEQ ID NOS : 34 -50) are shown in Figure 10 
and the cDNA sequences for rat cryptdins 1-3 (SEQ ID NOS: 

30 63-65) are shown in. Figures 14. A. to 14. C. In addition, 
the genomic DNA sequences for mouse cryptdins 1, 2, 3, 5 
and 6 (SEQ ID NOS: 53-57) and for an apparently 
inactivated cryptdin gene, cryptdin i (SEQ ID NO: 58) are 
shown in Figure 11 and the genomic DNA sequences for rat 

35 cryptdins 1-3 (SEQ ID NOS: 66-68) are shown in Figures 
15 .A. to 15 .C. 
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The skilled artisan would- recognize that 
various nucleotide substitutions could be made in the 
nucleic acid sequences shown in Figures 10, 11 14 and 15 
without altering the amino acid sequence of the encoded 
5 crypt din peptide due to degeneracy of the genetic code. 
Such nucleotide sequences, which are equivalent to the 
sequences shown in Figures 10, 11, 14 and 15 are 
encompassed within --Srhe . claimed invention. 

The invention also, provides nucleotide 

10 sequences that consist of a portion of a nucleic acid 
sequence as shown in Figures 10, 11, 14 and 15. Such a 
nucleotide sequence can be useful, for example, as a 
probe, which can hybridize under relatively stringent 
conditions to a nucleic acid molecule encoding a cryptdin 

15- peptide. For hybridization, such a nucleotide sequence 
should be at least about 10 nucleotides in length. One 
skilled in the art would know. that appropriate conditions 
for hybridization can be determined empirically or can be 
calculated . based, for example, on the G:C content of the 

20 nucleotide sequence, the length of the sequence and the 
number of mismatches, if any, between the probe and the 
target sequence (see, for example, Sambrook et al., 
Molecular Cloning: A laboratory manual (Cold Spring 
Harbor Laboratory Press 1989), which is incorporated 

25 herein by reference) . 

A nucleotide sequence as described above can be 
detectably labelled by attaching, for example, a 
radioactive label or biotin, or can be unlabelled. A 
labelled or unlabelled sequence also can be used as a 

30 primer for the polymerase chain reaction (PCR; see, for 

example, Erlich, PCR Technology: Principles and 

applications for DNA amplification (Stockto.n Press 1989) , 
which is incorporated herein by reference) . Such a 
sequence can be useful, for example, to- identify a 

35 nucleic acid sequence encoding a cryptdin in a cell. 

A nucleic acid molecule as shown in Figures 10, 
11, 14 and 15 or a nucleotide sequence derived therefrom 
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also can be useful, for example, for preparing a cryptdin 
peptide or a portion of a cryptdin peptide using well 
known methods of recombinant DNA technology. For 
example, the nucleic- acid, sequence can be cloned into an 
5 expression vector such' as a baculovirus vector or a viral 
vector, which can infect. a mammalian cell and. express an 
encoded cryptdin peptide in the cell. Expression from 
such a vector can be useful for producing large amounts 
of a cryptdin, which can be used to treat a subject 

10 having an inflammatory pathology as described herein, or 
for producing a cryptdin directly in a subject. Thus, 
the invention provides vectors containing a nucleic acid 
molecule as shown in Figures 10, 11, 14 and 15 as. well as. 
specific host, cells, in which the vector can propagate or 

15 can express a cryptdin. 

The following examples are intended to 
illustrate but not limit the invention. 

EXAMPLE I 

20 Purification of Enteric Defensins 

Outbred Swiss mice, (Crl : CD-I) ( ICR) BR,. 45 day 
old males (30-35 g) or timed-pregnant dams, were obtained 
from Charles River Breeding Laboratories,. Inc. (North 
Wilmington MA) . In studies of newborn mice, litters were 

25 culled to 8 pups within 12 hr of delivery, Mice were 
housed under 12 hr cycles of light and darkness and had 
free access to food and water. 

Cryptdins were isolated by a modification of 
the method described by Selsted et al . , J. Cell . Biol . 

30 118:929-936 (1992); Ouellette et al . , Infect. ' Immun. 
62:5040-5057 (1994), each of which is incorporated herein 
by reference. Jejunal and ileal intestinal segments were 
excised from 60 mice immediately after carbon dioxide 
euthanasia. The tissue was washed and the lumen was 

35 flushed with ice cold, water prior to homogeni zat ion in 
500 ml ice cold- 30% acetic acid. The homogenate was 
clarified by • centr if ugat ion , lyophilized, dissolved in 
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200 ml 30% acetic acid, clarified by filtration through 
Whatman 541 filter paper and applied to a 10 x 60 cm Bio- 
Gel P-60 column equilibrated with 30% acetic acid. The 
elution .rate was 100 ml/hr. Fractions containing 
5 cryptdins were identified by electrophoresis in acid-urea 
polyacrylamide gels (Selsted and Harwig., Infect. Immun. 
55:2281-2285 (1987), which is incorporated herein by 
reference) . 

Cryptdin-containing fractions were pooled and 

10 lyophilized, then purification was completed by RP-HPLC. 
Initial separation of * mouse cryptdins 2-5 was achieved by 
HPLC on a 1 x 25 cm Vydac C-18 column using a gradient of 
water .and acetonitrile containing 0.13% KFBA. Solvents 
were delivered at 3 ml/min to generate the following 

15 acetonitrile gradient: 0-28% (10 min) ; 28-34% . (20 min); 
and 34-40% (60 min). Cryptdins 1 and 6, which coeluted 
under these conditions, were resolved by C-18 RP-HPLC 
using 0.1% TFA as the ion pair and a 16-21% acetonitrile 
gradient delivered in 35 min at 3 ml/min. To eliminate 

20 traces of residual HFBa/ preparations of cryptdins 2-5 
were subjected to an addition RP-HPLC step using 0.1% 
TFA. All peptides were lyophilized and quantitated by 
amino acid analysis prior to antimicrobial testing. 
Essentially identical methods were used to purify rat a!nd 

25 human cryptdin peptides, except that rat cryptdins were 
isolated from the small intestine of adult . Sprague-Dawley 
rats and human cryptdins . were isolated from a surgically 
resected normal adult human male small intestine. 

30 * EXAMPLE II 

Peptide Characterization 
Amino acid analyses were performed on 6 N HC1 
hydrolysates (150 °C, 2 hr) of unmodified or performic 
acid-oxidized peptides. Hydrolysates were derivatized 
35 with phenylisothiocyanate and the resulting 
phenylthiocarbamyl amino acids were quantitated as 
described previously (Selsted and Harwig, supra, 1987; 
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Selsted et al . , supra, 1992 / Ouellette et al . , FESS Left- , 
304:146-148 (1992), which is incorporated herein by 
reference) . Peptide samples were reduced with 

dithiothreitol (DTT) and pyridyle thylated with 4 -vinyl 
5 pyridine for sequencing (Henschen, In Advanced Methods in 
Protein Microsequence Analysis (Wittmann-Liebold et al . , 
pages 244-255 (1986), which is incorporated herein by 
reference) . Sequence determinations were performed by 
automated Edman degradation on an ABI model 477 system 

10 (Applied Biosystems, Inc.; Foster City CA) with on-line 
PTH amino acid analysis. In some cases, the C- terminus 
of a. cryptdin peptide was confirmed by amino acid 
analysis of chymotryptic peptides. Crypt dins '4 and 5 
also were analyzed by positive- ion fast atom bombardment 

15 mass spectrometry on a VG 7070E-HF instrument (Ouellette 
.et al . , supra, 1994). - 

EXAMPLE III 

Antimicrobial Assavs 

20 Antibacterial activity was measured in an agar 

radial diffusion assay (Lehrer et al . , supra, 1991b) 
using wild type 5. typhimurium (ATCC 10428) or an 
isogenic phoP mutant of S. typhimurium (strain CS015 
phoP102 : :Tnl0d-Cam, Miller et al . , supra, 1989). Cells 

25 were grown to log phase in trypticase soy broth at 37 °C, 
harvested by centrif ugation and resuspended to 1 x 10 7 
colony forming units (CFU) per ml in 10 mM sodium 
phosphate buffer (pH 7.4) . 

A 100 m! aliquot of each organism was mixed 

30 with 10 ml 1% agarose in 0.03% (w/v) trypticase. soy 
medium, 10 mM sodium phosphate (pH 7.4) at 42. °C. 
Five /xl samples of peptide solution were pipetted into 
3 mm diameter wells formed in the agarose with a sterile 
punch. After 3 hr at 37 °c, the inoculated agarose plate 

35 was overlayed with 1% agarose containing 6% trypticase 
soy medium. After 12-16 hr, antimicrobial activity was 
apparent as clear zones surrounding wells loaded with 
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antibacterial samples; the sizes of the clear zones were 
concentration-dependent. 

Cryptdin antimicrobial activity in vitro was 
substantially enhanced in piperazine-N, N' -bis (2-ethane. 
5 sulfonic acid) (PIPES) or in N-2 -hydroxyethylpiperazine- 
N' -2-ethanesulf onic acid (HEPES) as compared to the 
' activity in sodium phosphate. Purified cryptdin peptides 
were dissolved at 3 to 300 ji"g/ml in 0.01% acetic acid and 
activity was examined against E. cali ML35 (ATCC) . In 

10 the radial diffusion assay, 5 /il peptide solution was 
transferred into wells formed in plates of 1% agarose 
buffered with 10 mM PIPES (pH 7.4} and containing l' x 10 s 
log-phase bacterid grown in trypticase soy broth. After 
3 hr at 37 °c, the plates were overlayed with 0.8% 

15 agarose containing 2X trypticase soy broth and incubated 
overnight. The antibacterial activities, of cryptdin 
peptides was .compared with the activity of rabbit 
neutrophil defensin NP-1, which was purified from 
peritoneal exudates as described by Selsted et al . ( J . 

20 Biol. Chem. 26 0:4 579-4 584 (1985), which is incorporated 
herein by reference) . Antibacterial activity was 
determined by measuring the diameter of clearing, around, 
each well and plotted as a function of ■ peptide 
concentration . 

25 As shown in Figure 12, each cryptdin peptide 

produced a dose -dependent zone of clearing, which 
indicates that E. coli growth was inhibited. The 
potencies of the cryptdins-. varied, with cryptdins 1, 3. 
and 6 showing similar activity, which, was about. 3-5x 

30 greater than the activity of cryptdin 2. Cryptdin 5 was 
approximately 5x more active than rabbit NP-1 at 
concentration above 100 (Figure 12 ,C.) "and 

.cryptdin 4 was at least 50x more active than NP-1 when 
compared .at 100 ^ig/ml and 300 jxg/ml (Figure 12. BJ. 

35 These higher concentrations of cryptdins 4 and 5 also 
were more effective than the same concentration of NP-1 
at inhibiting the growth of S. aureus and of wild type 
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and mutant strains of S. typhimurium {not shown) . These 
results demonstrate that various cryptdin peptides can 
inhibit bacterial growth. ' 

In order to determine whether the effect of the 
5 cryptdin peptides against E. coli is bacteriostatic or 
bacteriocidal, bacterial killing was quancitated as a 
function of time. Bactericidal assays were performed by 
incubating 1-2 x 10 s log-phase bacteria in. 10 mN PIPES 
containing 10 ptg peptide/ml. After incubation for 15 or 

10 30 min at 37 °C, aliquots were removed, serially diluted 
and plated on trypticase soy agar. Bactericidal activity 
was quantitated by counting colonies after overnight 
incubation at 37 °C. 

As shown in Figure 13, cryptdins 1 and 3-6 

15 rapidly killed the E. coli cells. In each of these 
cases, survival was reduced to less than 1% after only 15 
min incubation. Cryptdin 2 was the only peptide tested 
that was not bactericidal under the assay condition. 
Cryptdins 2 and 3 differ only at amino acid position 10 

20 (threonine and lysine, respectively) . 

The bactericidal activity of rat cryptdin 1 
also was examined. E. coli ML35 cells, S. aureus 502A 
cells or mutant or wild type S. typhimurium cells were 
incubated with various concentrations of rat cryptdin 1 

25 or rabbit NP-1. Ten /xg/ml rat cryptdin 1 killed about 
90% of the S.. aureus cells and greater than 99% of the 
E. coli and mutant S. typhimurium cells, but was 
relatively ineffective in killing wild type 
5. typhimurium (not shown) . Rat cryptdin 1 was more 

30 effective than NP-1 in killing the E. coli and mutant 
S. typhimurium cells, whereas NP-1 was more effective in 
. killing S. aureus. 

The effect of mouse cryptdins 1-3 and 6 at 
inhibiting the growth of the protozoan, Giardia lamblia, 

35 which is the most common cause of protozoan disease in 
the' human small intestine, also was examined. Briefly, 
trophozoites of the CS clone of Gi^^dia lamblia WB (ATCC 
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30957} were grown to late log phase in TYI-S-33 medium 
containing bovine bile. Free-swimming trophozoites were 
discarded and tubes with attached trophozoites were 
refilled with warm Dulbecco's PBS. Trophozoites were 
5 detached by chilling 10 min on ice, then harvested by 
centrif ugation, resuspended at 2 x 10 7 /ml in 25 mM HEPES 
<pH 7.5) containing 9% (isotonic) sucrose and incubated 
for 2 hr at 37 °C with various concentrations of mouse 
cryptdins 1-3 or 6. Following incubation, trophozoite 

10 viability was determined by trypan blue exclusion. • 

The cryptdin peptides killed Giardia 
trophozoites in a dose -dependent manner (not shown) . In 
particular, 20 ptg/ml of cryptdin 2 or cryptdin 3 reduced 
Giardia growth by greater than 2 orders of magnitude (not 

15 shown) . These results indicate that cryptdins are active 
against a variety of microorganisms. 

EXAMPLE IV 

Ant i- cryptdin Antibody 

20 A polyclonal rabbit antibody was prepared to a 

synthetic analogue of. "cryptdin 1. The peptide, termed 
cryptdin C, corresponding to residues 4-35 in cryptdin 1 
(SEQ ID NO: 9; Figure 1) was synthesized by. solid phase 
chemistry using N°-butoxycarbonyl protection (Kent, Ann . 

25 Rev. Siochem. 57:957-989 (1988), which is incorporated 
herein by reference) . Following cleavage/deprotection of 
synthetic cryptdin C with TFA- trif luorome thanesulf onic 
acid, the peptide was precipitated in ethyl ether and 
dried in vacuo. A 100 mg sample was dissolved in 10 ml 

30 6.0 M guanidine-HCl, 0.2 M Tris-HCl, pH 8.2, containing 
20 mg DTT . The sample was purged with nitrogen, heated 
to 50 °C for 4 hr and diluted 100-fold with deionized 
water, then was dialyzed exhaustively, first against 
0.1 M sodium phosphate (pH 8.2), 20 mM guanidine-HCl, 

35 100 mM NaCl, then against 5% acetic acid. The sample was 
lyophil.ized,. dissolved in 10 ml 5% acetic acid and 
subjected to RP-HPLC on' a 1 x '25 cm Vydac C-18 column. 



QN^OOCID. <WQ 961607SAI J_» 



WO 96/16075 



PCI7US95/13328 



29 

The earliest eluting peak, representing about 0.5% of the 
crude peptide, was- determined by amino acid analysis to 
have the desired composition. 

A sample (1.5 mg) of cryptdin C was supplied, 
5 without conjugation ' to carrier, to Berkeley Antibody 
Company (Berkeley, CA) for immunization of 2 New Zealand 
White rabbits. Serum samples were collected for 12 
weeks, until the anti -cryptdin C titer, determined by 
ELISA, reached about 1:10,000 for each rabbit. IgG was 
10 isolated from antiserum using DEAE Econo-Pac 
chromatography (Bio-Rad; Richmond CA) as described by the 
manufacturer . 

EXAMPLE V 

15 Immunohistochemistrv 

Paraffin sections of formalin- fixed —mouse 
mid-small bowel were deparaf f inized, treated with 1.1% 
hydrogen peroxide for 4 0 min, then washed extensively 
with water followed by PBS. Slides were treated for 20 

20 min at 37 °C with 500 }ig/ml trypsin in PBS, washed twice 
with PBS, and blocked by incubation for 20 min with 5% 
porcine serum. Slides were incubated for 20 min in 
rabbit anti-cryptdin IgG (1:10 dilution relative to serum 
IgG concentration), then washed with blocking serum. 

25 Porcine anti-rabbit IgG was used as linking reagent 
between the primary antibody and rabbit antiperoxidase- 
peroxidase conjugate (Dako; Carpinteria CA) . 
Diaminobenzidine was used as peroxidase substrate and 
parallel incubations were performed using equivalent 

3 0 dilutions of rabbit preimmune IgG as the primary 
antibody . 

EXAMPLE VI 

Preparation of Synthetic Cryptdin 1 
35 This example . provides ' a method for 

synthesizing, purifying and characterizing synthetic 
cryptdin 1. 
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A. Synthesis 

Synthesis was initiated at the 0.13 mmole scale 
using Wang resin coupled to f lourenylmethoxycarbonyl 
(Fmoc) -arginine using an acid labile linker. Synthesis 
5 was carried out • in dimethyl formamid'e (DMF) using 
(relative to resin substitution) a 3 -fold excess of Fmoc- 
amino acids activated in situ with a 3-fold excess of BOP 
(benzotriazol-l-yl-oxy-tris (dimethyl ami no) phosphonium 
hexaf luorophosphate) and HOBt (hydroxybenzotriazole) and 

10 a 6 -fold molar excess of N-methylmorpholine (Nmm) . Fmoc. 
removal during synthetic cycles was achieved using 
cycles of 50% and 25% piperidine in DMF. The side-chain 
protection scheme utilized the following Fmoc-amino 
acids: OtBut-aspartic acid, Pmc-arginine , tBut- tyrosine , 

15 tBut-serine, Trt - cysteine , tBoc-lysine, OtBut-glutamic 
acid, Trt-asparagine , tBut - threonine and Trt -hist idine . 

The peptide chain was assembled in a Synostat 
batch synthesizer using single couplings at all additions 
except at leucine and valine which were double coupled. 

20 The cycle sequence is as follows: 

1. Wash with DMF 4X for 2 min; 

2. Deblock: 50% piperidine IX for 5 min; 

3. Deblock: 25% piperidine IX for 15 min; 

4. Wash with DMF 4X for 2 min; 

25 5. Dissolve amino acids + BOP + HOBt in DMF 

and transfer to reaction vessel; 

6. Add Nmm to RV and mix for 60 min; and 

7. Wash with DMF IX .for 2 min. 

30 After coupling of the amino terminal residue, 

the terminal Fmoc group was removed using the following 
cycle: 

1. Wash with DMF 4X for 2 min; 

2. Deblock: 50% piperidine IX for 5 min; 
35 3. Deblock: 25% piperidine IX for 15 min; 

4. Wash with DMF ■ 4X for 2 min; 

5. Wash with dichloromethane IX for 5 min; 
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6. Wash with isopropanol 4X for 5 min,- 

7. Dry under stream of N 2 IX for 10-20 min; 

and 

8. Dry under vacuum IX for 12 hr. 

5 The peptide-resin was weighed to determine mass 

■increase. To cleave and deprotect the peptide-resin, it 
was first reswelled in dichloromethane , then cleaved and 
deprotected by addiirron of reagent R (90% tri f luoroacetic 
acid, 5% thioanisole, 3% ethanedithiol , 2% anisole) at a 
10- ratio of 10 ml/g peptide-resin. Cleavage/deprotect ion 
was carried out under nitrogen for 18 hr at RT. 

B. Purification 

The cleavage mixture was separated from resin 

15 by. filtration through a scintered glass funnel, washed 
with 1-2 ml fresh reagent R and diluted 5-fold with 50% 
acetic acid. Glacial acetic acid was added to a final 
acetic acid concentration of 50%. The resulting solution 
was extracted 3x with 0.33 vol methylene chloride and the 

20 aqueous phase was lyophilized to dryness, then dissolved 
in 50% acetic acid and relyophil ized . The extraction and 
lyophilization steps were repeated 3-4 times, then the 
dry peptide was dissolved in 30% acetic acid .at a 
concentration of 20 mg/ml and passed over an 800 ml 

25 Sephadex G-10 column equilibrated in 30% acetic acid. 
Pept ide-containing fractions were pooled, lyophilized, 
dissolved in 5% acetic acid, then diluted ten-fold with 
water to a final protein concentration of about 1 mg/ml . 
The solution was adjusted to' pH 8.0 with ammonium 

30 hydroxide and mixed rapidly with a magnetic stirrer at RT 
in a beaker open to room air. The pH was adjusted 
periodically to pH 8.0 over a period of 4 days. The 
solution was then acidified with acetic acid to pH 3.5 
and lyophilized. 

35 C-18 RP-HPLC using 0.1% TFk-water/acetonitr ile. 

gradients was used to purify the folded .peptide. 
Fractions were analyzed on acid-urea gels and compared to 
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natural cryptdin L. The yield from an initial crude 
peptide preparation of 500 mg was approximately 30 mg. 

C. Characterization 
5 Synthetic cryptdin 1 was compared to natural 

peptide on analytical RP-HPLC, .SDS-PAGE and under three 
different conditions on acid-urea PAGE. For analysis on 
acid-urea PAGE, peptide was electrophoresed either 
without modification, after reduction with DTT or after 
10 performic acid oxidation. Under all. conditions 

described, native and synthetic cryptdin 1 behaved 
idencically. The amino acid compositions of natural and 
synthetic cryptdin 1 were indistinguishable. 

15 EXAMPLE VII 

Cloning of Nucleic Acid Molecules Encoding Crvptdins 

Individual crypts were isolated using a 
modification of the EDTA elution method of Bjerknes and 
Cheng, supra, 1981, as described by Cano-Gauci et al . , 
20 Expt . Cell Res . 208:344-349 (1993), which is incorporated 
herein by reference. Briefly, the central 10 cm of small 
intestine from an adult C3H/HeJ mouse was everted on a 
Buchler gradient -making apparatus, then intact crypts 
were dislodged by vibration in ice cold 30 mM EDTA in 
25 calcium-free, magnesium-free PBS. Isolated crypts were 
disrupted in a sonicating water bath prior to cDNA 
- synthesis . 

The crypt library was constructed by mRNA- 
• directed PCR amplification (Cano-Gauci et al . , supra, 
• 30 1992) . Phage were screened at a density of approximately 
300 PFU/dish using the partial cDNA clone, asb4/134, as 
a probe (Ouellette et al . , supra, 1989a) . Positive phage 
were collected and denatured plasmid cDNA was sequenced 
by the dideoxynucleot ide termination method using 
35 Sequenase™ (U.S. ' Biochemical Corp.; Cleveland OH). 
Sequencing primers included T3 and T7 promoter primers 
and Defcr pLi0 , which is a 16-mer that corresponds to 
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nucleotides 90-105 in cryptdin 1 mRNA (Huttner et al . , 
Genomics 19: 448-453 (1994) , which is incorpo raced herein 
by reference) . Reaction mixtures were separated by 
electrophoresis in gels consisting of 5% Long Ranger™ (AT 
5 Biochem, Inc.; Malvern PA) and DMA sequence data were 
analyzed (Ouellette et al., supra, 1994). Computations 
for similarity searches of DNA sequences in nonredundant 
nucleic acid "and protein sequence databases were 
performed at the National Center for Biotechnology 

10 Information with the BLAST network service (Ouellette et 
al . , supra, 19 94) . 

A cDNA . library also- was prepared by 
amplification of cryptdin mRNA (Huttner et al., supra, 
1994) . Total RNA was isolated from the small intestine 

15 of a male 129/SVJ mouse using RNazol™ (Biotecx Lab; 
Houston TX) . First strand cDNA synthesis was performed 
using the cDNA Cycle Kit ( Invitrogen; San Diego CA) . 
Amplification of 5' ends was performed using the 5' RACE 
method {Frohman et- al . , Proc. Natl. Acad. Sci., USA 

20 85:8998-9002 (1988),. which is incorporated herein by 
reference) with a reverse primer that was specific for a 
conserved region of the cryptdin 3 ' -untranslated sequence 
(UTS) . 

Blot hybridization of the PCR products using an 
25 oligonucleotide probe specific for the cryptdin prepro- 
coding -region detected a single band. DNA from the band 
was isolated using the Geneclean II™ kit (BiolOl; La 
Jolla CA) , subclone.d into the Blues.cript II plasmid using 
the pCR-Script SK ('+) cloning - kit (Stratagene) and 
30 transfected into competent XL-1 Blue cells (Stratagene) . 
Colonies containing cryptdin-related sequences were 
identified by hybridization to a labelled asb4/134 probe. 
DNA sequence analysis of the positive clones was 
performed as described above, except that internal 
35 primers were utilized as required. 

Using these methods, cDNA sequences encoding 17 
distinct mouse cryptdin peptides were identified {Figure 
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10; SEQ ID NOS: 34-50).. The various mouse cryptdin cDNA 
sequences share 93-100% nucleotide sequence identity with 
cryptdin 1, except cryptdin 5 and cryptdin 4 share 73% 
and 69% sequence identity, respectively; with cryptdin 1. 
5 The amino acid sequences were deduced from the. 

cDNA sequences for the 17 mouse cryptdins (see Figure 
9. A.; SEQ ID NOS: 9-11, 15, 16, 17 and 23-33) . As shown 
in Figure 9. A., the cDNA sequences encode prepro-cryptdin 
peptides consisting of a signal peptide, a propiece and 

10 the cryptdin peptide. The prepro-cryptdins , including 
the mature cryptdin peptide, share significant amino acid 
sequence identity with cryptdin 1, although cryptdins 4 
and 5 are less homologous (Figure 9.B.). Amino acid 
variability was most striking at position 10 of the 

15 mature cryptdin peptide, where either serine*, threonine,, 
alanine, isoleucine or lysine can be found. 
Interestingly r a single nucleotide change in the sequence 
of cryptdin 2 can account for each of these amino acids. 
In addition, position 15 can contain arginine or lysine. 

20 The amino acid variability among cryptdin peptides can .be 
involved in conferring different antimicrobial properties 
to the cryptdins. 

Mouse cryptdin genomic clones also were 
obtained and sequenced (Huttner et al . , supra, 1994). 

25 Asb4/134 was used as a probe to screen a custom-made . 
129/SVJ mouse genomic library constructed in lambda DASH 
II (Stratagene .Cloning Systems, Inc.; La Jolla CA) . 
Approximately 1 x 10 s phage were screened in duplicate and 
25 positive phage were identified. Ten clones were 

30 purified and phage DNA was isolated using Qiagen 100 
columns (Qiagen, Inc.; Chatsworth CA) . Southern blots of 
. Eco RI -digested DNA from individual phage were hybridized 
to asb4/134 and hybridizing fragments were subcloned into 
Bluescript II SK*™ (Stratagene) or pUC18 (BRL; 

35. Gaithersburg MD) for sequencing. 

Sequencing was performed as described above, 
except that primers were selected based on the cryptdin 
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1 cDNA sequence and with the expectation that mouse 
cryptdin genes would be structurally homologous to the 
rabbit MCP-1 and MCP-2 defensin genes (see Huttner et 
al., supra, 1994} . DNA sequence data were analyzed using 
5 the programs of Staden ( Biochem. Soc . Trans. 12:1005-1008 
(1934) and the University of Wisconsin Genetics Computer 
Group (Devereux et al . , Nucl . -Acids Res. 12:387-395 
(1985)) . Searches for homology were performed as 
described above . 

10 As shown in Figure 11, screening of the genomic 

library produced nucleic acid sequences that contained 
the complete coding sequences for mouse cryptdins 1, 2, 
3, 5 and 6 (SEQ ID NOS : 53 --57") . In addition/ a. 
homologous gene, designated cryptdin i (Crypi; SEQ ID 

15 NO: 58), which apparently was inactivated due to a point 
mutation that changed a cysteine codon to an in"- frame 
stop codon, was isolated. Examination of the nucleic 
acid sequences revealed that the cryptdin genes contain 
two exons, the first of which codes for the 5' -UTS and 

20 .the prepro-coding region and the second of which encodes 
the mature cryptdin peptide and the 3 ' -UTS (not shown; 
but see Figure 11. A.). A similar structure has been 
"described for the human cryptdin genes (Jones and Bevins, 
J. Biol . Chem. 267:23216-23225 (1992)). 

25 Similar methods as described above were used to 

obtain the cDNA sequences encoding rat cryptdins 1-3 
(Figures 14. A. to 14. C, ; SEQ ID NOS: 63-65, 
respectively) , except that RNA was obtained from the 
small intestine of Sprague-Dawley rats. In addition, 

30 genomic DNA sequences encoding rat cryptdins 1-3 (Figures 
15. A. to 15. C; SEQ ID NOS: 66-63, respectively) were 
obtained using methods as described above, except 1 that a 
genomic library containing Sprague-Dawley DNA cloned in 
EMBL3 was purchased from Clontech (Palo Alto CA) . 

35 Although the invention has been described with 

reference to the disclosed embodiments, it" should be 
understood that various modifications can be made without 
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departing from the spirit of the invention. Accordingly, 
the invention is limited only by the following claims. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: THE REGENTS QF THE UNIVERSITY OF. 

CALIFORNIA ' 

TITLE OF INVENTION: Antibiotic Cryptdin Peptides and Methods 
of Their Use 

(Hi) NUMBER OF SEQUENCES: 70 

(iv) CORRESPONDENCE A00RESS: 

(A) ADDRESSEE: Rabbins, Berliner & Carson 
(8) STREET: 201 N . Figueroa Street, 5th Floor 
(C) CITY: Los Angeles 
(0) STATE: California 
<E) COUNTRY : USA 
(F) ZIP: 90012-2628 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER : IBM PC compatible 

CO OPERATING SYSTEM: PC-0OS/MS-0OS 

(0) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 
<A) APPLICATION NUMBER: 

(B) FILING. OATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION OATA: 

(A) APPLICATION NUMBER: US 07/930,649 
CB> FILING OATE: H-AUG-1992 

(vii) PRIOR APPLICATION OATA: 

(A) APPLICATION NLWBER: US 07/339,020 
(3) FILING DATE: 26-MAY-1992' 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Berliner, Robert 

(B) REGISTRATION NUMBER: 20,121 

(C) REFERENCE/DOCKET NUMBER : 5555-339C1 -XPC 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (213) 977-1001 

(B) TELEFAX: (213) 977-1003 



(2) INFORMATION FOR SEQ 10 NQ:1: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: U amino acids 
(3) TYPE: amino acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID HO:1: 

Leu Ser Lys Lys 

1 

(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(0) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2; 

Gly tie Arg Phe Leu Tyr 
1 5 

(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

Arg Asn Leu Phe Leu Thr .Phe Vat Phe 
1 5 

(2) INFORMATION FOR SEQ 10 NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:<»: 

Arg Arg Gly His Leu He: Tyr Thr Leu 
1 5 

(2) INFORMATION FOR SEQ ID NO:5: 

( i ) SEQUENCE CHARACTER 1STICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(0) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 6 

(0) OTHER INFORMATION: /notes "Xaa = Amino acid is 
independently L or M." 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 9 

(0) OTHER INFORMATION: /notes "Xaa * Amino acid is 
independently L or M M 



(xi) SEQUENCE DESCRIPTION: SEQ 10 NO:5: 

Arg Lys Gly His Leu Xaa Tyr Thr Xaa 
1 5 

(2) INFORMATION FOR SEQ 10 HO: 6: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 35 amino acids 
(8) TYPE: amino acid 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ 10 NO:6:- 

Leu Arg Asp Leu Val Cys Tyr Cys Arg ser Arg Gty Cys Lys Gly Arg 
1 5 .10 15 
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Glu Arg Net Asn Gly Thr Cys Arg Lys Gly His Leu Leu Tyr Thr Leu 
20 25 , 30 



Cys Cys Arg 
35 



(2) INFORMATION FOR SEC 10 MO:7: 

Ci) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 35 amino acids 
(8) TYPE: amino acid 
(0) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEO ID NO:7: 

Leu Arg Asp Leu Val Cys Tyr Cys Arg Thr Arg Gly Cys Lys Arg Arg 
1 5 10 15 

Glu Arg Met Asn Gly Thr Cys Arg Lys Gly His Leu Me: Tyr Thr Leu 
20 25 30 



Cys Cys Arg 
35 



(2) INFORMATION FOR SEQ 10 NO:8: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 35 amino acids 
(8) TYPE: amino acid- 
CD) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

Leu Arg Asp Leu Val Cys Tyr Cys Arg Lys Arg Gly Cys Lys Arg Arg 
1 5 10 15 



Glu Arg Met Asn Gly Thr Cys Arg Lys Gly His Leu Met Tyr Thr Leu 
20 25 30 



Cys Cys Arg 
35 

. C2) INFORMATION FOR SEQ ID N0:9: 

Ci) SEQUENCE CHARACTERISTICS: 

CA) LENGTH: 31 amino acids 

CB) TYPE: amino acid 
CD) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ 10 NO:9: 

Gly Leu Leu Cys Tyr Cys Arg Lys Gly His Cys Lys Arg Gly Glu Arg 
15 10 15 



Val Arg Gly Thr Cys Gly lie Arg Phe Leu Tyr Cys Cys Pro Arg 
20 25 30 
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(2) INFORMATION -FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 35 amino acids 

(B) TYPE: amino acid 
(0) TOPOLOGY: Linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID MO; 10: 

Leu Ser Lys Lys Leu Ue Cys Tyr Cys Arg lie Arg Gly Cys Lys Arg 
15 10 15 



Arg Glu Arg Val Phe Gly Thr Cys Arg Asn Leu Phe Leu Thr Phe Val 
20 25 30 



Phe Cys Cys 
35 

(2) INFORMATION FOR SEQ 10 NO : 1 1 : 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 39 amino acids 

(B) TYPE: amino acid 
(0) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ 10 NO: 11: 

Leu Lys Gin Cys His Cys Arg Lys Phe Cys Arg Pro Tyr Glu Lys Ala 
1 5 10 15 



Glu Gly Ser Cys Arg Pro Gly Leu Phe He Lys Arg Lys tie Cys Cys 
20 25 30 



Ue Gin Gin Trp Thr Pro Gly 
35 

(2) INFORMATION FOR SEQ 10 N0:12: 

(i) SEQUENCE CHARACTERISTICS': 
(A) LENGTH: 32 amino acids 
(8) TYPE: amino acid 
CD) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ 10 N0:12: 

Gly Leu Leu Cys Tyr Cys Arg Lys Gly His Cys Lys Arg Gly Glu Arg 
15 10 15 



Val Arg Gly Thr Cys Gly Ue Arg Phe Leu Tyr Cys Cys Pro Arg Arg 
20 25 30 



C2) INFORMATION FOR SEO 10 NO:H: 

C i ) SEQUENCE CHARACTER 1ST ECS : 
CA) LENGTH: 36 amino acids 
(3 ) TYPE: amino acid 
CO) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 13: 

Leu Ser Lys Lys Leu lie Cys Tyr Cys Arg lie Arg Gly Cys Lys Arg 
1.5 10 15 



Arg Glu Arg Val Phe Gly Thr Cys Arg Asn Leu Phe Leu Thr Phe Val 
20 25 30 



Phe Cys Cys Ser 
35 

(2) INFORMATION FOR SEQ ID NO:K: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE : amino acid 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:K: 

Leu Arg Asp Leu Val Cys Tyr Cys Arg Ala Arg Gly Cys Lys Gly Arg 
15 10 15 



Glu Arg Met Asn Gly Thr Cys Arg Lys Gly His Leu Leu Tyr Met Leu 
20 25 30 



Cys Cys Arg 
35 

(2) INFORMATION FOR SEQ ID NO: 15: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 amino acids 
(S) TYPE: amino acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ 10 N0:15: 

Leu Lys Gin Cys His Cys Arg Lys Phe Cys Arg Pro Tyr Glu Lys Ala 
15 10 15 



Glu Gly Ser Cys Arg Pro Gly Leu Phe lie Lys Arg Lys lie Cys Cys 
20 25 30 



He Gin Gin Trp Thr Pro Gly Arg Thr 
• 35 " 40 

(2) INFORMATION FOR SEQ 10 HO:16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE : amino acid 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ 10 N0:16: 

lie Gly Arg Pro Val Arg Arg Cys Arg Cys Arg Ala Asn Cys Gly Pro 
15 10 15 



Lys Glu Tyr Ala Thr Ala Phe Cys Ala Gin Gly Pro Phe Lys, Gin Phe 
20 25 30 
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.Lys Phe Cys Cys Thr 
35 

(2) INFORMATION FOR S£Q 10 MO: 17: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 45 amino acids 
(8) TYPE: amino acid 
CO) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ tD HO: 17-: 

lie Arg Trp Pro Trp Lys Arg Cys His Cys Arg Ser Phe Cys Arg Pro 
1 5 10 15 



Tyr Clu Asn Ala Thr Ser Phe Cys- Ala Gin Gly Leu Phe Lys Gin His 
20 2S 30 



Lys Phe Cys Cys Leu Asp Thr Trp Pro Pro Arg Met Lys 
35 40 45 

(2) IN FORMA TION FOR SEQ ID NO: 15: 

<i> SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 39 amino acids 
CB) TYPE: amino acid 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: 

Thr Ser Gly Ser. Gin Ala Arg Ala Thr Cys Tyr Cys Arg Thr Gly Arg 
1 5 '10 15 



Cys Ala Thr Arg Glu Ser Leu Ser Gly Val Cys Glu lie Ser Gly Arg 
20 25 30 



Leu Tyr Arg Leu Cys Cys Arg 
35 

(2) INFORMATION FOR SEQ 10 NO: 19: 

<i> SEQUENCE CHARACTERISTICS; ■" 
(A) LENGTH; 32 amino acids 
(8) TYPE: amino acid 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Ala Phe Thr Cys His Cys Arg Arg Ser Cys Tyr Ser Thr Glu Tyr Ser 
1 5 10 .15 



Tyr Gly Thr Cys Thr Val Met Gly lie Asn His Arg Phe Cys Cys Leu 
20 25 ■ 30 



(2) INFORMATION FOR SEa ID NQ:20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 amino acids 
CS) TYPE: amino acid 
(0) TOPOLOGY: linear 
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C*i) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

Met Lys Thr Leu Val Leu Leu Ser Ala Leu Val Leu Leu Ala Phe Gin 
1 5 10 15 



Val Gin Ala Asp Pro He Cln Glu Ala Glu Glu Glu Thr Lys Thr Glu 
20 25 30 



Glu Gin Pro Ala Asp Glu Asp Gin Asp Val Ser Val Ser Phe Glu Gly 
35 40 45 



Pro Glu Pro Ser Ala Leu Gin Asn Leu Glu lie Gly Trp Pro Leu Lys 
50 55 60 



Gin Cys His Cys Arg Lys Phe Cys Arg Pro Tyr Glu Lys Ala Glu Gly 
65 70 75 80 



Ser Cys Arg Pro Gly Leu Phe He Lys Arg Lys lie Cys Cys tie Gin 
85 90 95 



Gin trp Thr Pro Gly Arg Thr 
100 

(2) INFORMATION FOR SEQ 10 NO:21: 

(j) SEQUENCE CHARACTERISTICS: 
. (A) LENGTH: 96 amino acids 
(8) TYPE: amino acid 
(0) TOPOLOGY: linear 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO:21 : 

Met Lys Thr Leu Val Leu Leu Ser Ala Leu Val Leu Val Ala Tyr Gin 
■1 5 10 15 



Val Gin Ala Asp Pro lie Gin Gly Ala Glu Glu Glu Thr Lys Thr Glu . 
20 25 30 



Glu Gin Pro Ser Asp Glu Asp Gin Asp Val Ser Val Ser Phe Glu Gly 
35 40 45 



Pro Glu Ala Ser Ala Leu Gin Asp Phe Glu He Gly Arg Pro Val Arg 
50 55 60 



Arg Cys Arg Cys Arg Ala Asn Cys Gly Pro Lys Glu Tyr Ala* Thr Ala 
65 70 75 80 



Phe Cys Ala Gin Gly Pro Phe Lys Gin Phe Lys Arg Phe Cys Cys Thr 
85 90 95 



(2) INFORMATION FOR SEQ 10 NO:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 amino acids 
(8) TYPE: amino acid 
CD) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ 10 NO:22: 

Met lys Thr Leu VaL Leu Leu Ser Ala Leu Val Leu Leu Ala Phe Gin 
15 10 15 



Me Gin Ala Asp Pro lie Gin Glu Ala Glu Glu Glu Thr Lys Thr Clu 
. 20 25 30 



Glu Gin Pro Ala Asp Glu Asp Gln.Asp Val Ser Val Ser Phe Glu Gly 
35 CO 45 



Pro Glu Pro Ser Ala Leu Gin Asn Leu Glu lie Arg Trp Pro Trp Lys 
50 55 60 



Arg Cys His Cys Arg Ser Phe Cys Arg Pro Tyr Glu Asn Ala Thr Ser 
65 , 70 75 80 



Phe Cys Ala Gin Gly Leu Phe Lys Gin His Lys Phe Cys Cys Leu Asp 
85 90 95 



Thr Trp Pro Pro Arg Met Lys 
100 

<2> INFORMATION FOR SEQ 10 NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 amino acids 

(B) TYPE; amino acid 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

Met Lys Thr Leu Val Leu Leu Ser Ala Leu Val Leu Leu Ale Phe Gin 
15 10 15 



Val Gin Ala Asp Pro lie Gin Asn Thr Asp Glu Glu Thr lys Thr Glu 
20 25 30 



Glu Gin Pro Gly Glu Asp Asp Gin Ala Val Ser Val Ser Phe Gly Asp 
35 40 45 



Pro Glu Gly Thr Ser Leu Gin Glu Glu Ser Leu Arg Asp Leu Val Cys 
50 55 - 60 



Tyr Cys Arg Ser Arg Gly Cys Lys Gly Arg Glu Arg Met Asn Gly Thr 
65 .70 .75 80 



Cys Arg' Lys Gly His Leu Leu Tyr Thr Leu Cys Cys Arg 
85 90 

(2) INFORMATION FOR SEQ 10 NO:24: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 93 amino acids 
(8) TYPE: amino acid 
(0) TOPOLOGY: linear 
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.Cxi) SEQUENCE DESCRIPTION; SEQ 10 NO:24: 

Met lys Thr Leu lie Leu Leu Ser Ala Leu Val Leu Leu Ala Phe Gin 
1 5 10 15 

Val Gin Ala Asp Pro lie Gin Asn Thr Asp Gig Glu Thr Lys Thr Glu 
20 25 . 30 

Lys Gin Pro Gly Glu Glu Asp Gin Ala Val Ser Val Ser Phe Gly Asp 
35 40 45 

Pro Glu Gly Ser Ser Leu Gin Glu Glu Ser Leu Arg Asp Leu Val Cys 
50 55 60 

Tyr Cys Arg Thr Arg Gly Cys Lys Arg Arg Glu Arg Met Asn Gly Thr 
65 70 75 SO 

Cys Arg Lys Gly His Leu..Met Tyr Thr Leu Cys Cys Arg 
85 90 

(2) INFORMATION FOR SEQ 10 NO:25: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 93 amino acids 

CB) TYPE: amino acid _ 
(0) TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ 10 NO:25: 

Met Lys Thr Leu Val Leu Leu Ser Ala Leu Val Leu Leu Ala Phe Gin 
15 10 15 



Val Gin Ala Asp Pro He Gin Asn Thr Asp Glu Glu Thr Lys Thr Glu 
20 25 30 



Glu Gin Pro Gly Glu Asp Asp Gin Ala Val Ser Val Ser Phe Gly Asp 
35 40 45 



Pro Glu Gly Ser Ser Leu Gin Glu Glu Ser Leu Arg Asp Leu Val Cys 
50 55 60 



Tyr Cys Arg Lys Arg Gly Cys Lys Arg Arg Glu Arg Met Asn Gly Thr 
65 70 75 80 



Cys Arg Lys Gly His Leu Met Tyr Thr Leu Cys Cys Arg 
35 90 

C2) INFORMATION FOR SEQ ID NO:26: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 92 amino acids 
(8) TYPE: amino acid 
(0) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 79 

(0) OTHER INFORMATION: /notes "Xaa s Amino acid is 
independently L or M." 

Cix) FEATURE: 
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(A) NAME/KSY; Peptide 
(3) LOCATION: 80 

(0) OTHER INFORMATION: /note* "Xaa * Amino acid is 
independently L or M." 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 
" (B) LOCATION: 82 

(0) OTHER INFORMATION : /note's «xaa a Amino acid is 
independently L or M . " 



Cxi 3 SEQUENCE DESCRIPTION: SEQ 10 NO:26: 

Leu Val Leu Leu Ser Ala Leu Val Leu Leu Ala Phe Gin Val Gin Ala 
15 10 15 



Asp Pro lie Gin Asn Thr Asp Glu Glu Thr Lys Thr Glu Glu Gin Pro 
20 25 30 



Gly Glu Glu Asp Gin Ala Val Ser lie Ser Phe Gly Gly Gin Glu Gly 
35 40 45 



Ser Ala Leu His Glu Lys Ser Leu Arg Gly Leu Leu Cys Tyr Cys Arg 
50 55 60 



Lys Gly His Cys Lys Arg Gly Glu Arg Val Arg Gly Thr Cys Xaa Xaa 
65 70 75 SO 



Gly Xaa lie Arg Phe Leu 7yr Cys Cys Pro Arg Arg 
85 90 

(2) INFORMATION FOR SEQ ID NO:27: 

(i) SEQUENCE. CHARACTERISTICS: 

(A) LENGTH: 93 amino acids 

(B) TYPE: amino acid 
(0) TOPOLOGY: linear 



<xi) SEQUENCE 0ESCRIPTIQN: SEQ 10 NO:27: 

Met Lys Thr Phe Val Leu Leu Ser Ala Leu Val Leu Leu Ala Phe Gin 
1 5 10 15 



Val Gin Ala Asp- Pro tie His Lys Thr Asp Glu Glu Thr Asn Thr Glu 
20 25 30 



Glu Gin Pro Gly Glu Glu Asp Gin Ala Val Ser lie Ser Phe Gly Gly 
35 40 C5 



Gin Glu Gly Ser Ala Leu His Glu Glu Leu Ser Lys Lys Leu lie Cys 
50 55 60 



Tyr Cys. Arg tie Arg Gly Cys Lys Arg Arg Glu Arg Val Phe Gly Thr 
65 70 75 80 



Cys Arg Asn Leu Phe Leu Thr Phe Val Phe Cys Cys Ser 
85 90 
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(2) INFORMATION FOR SEQ 10 NO:2B: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 93 amino acids 
<B) TYPE: amino acid 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 



He: Lys Thr Leu Val Leu Leu Ser Ala Leu Val Leu Leu Ala Phe Gin 
1 5 10 15 



Val Gin Ala Asp Pro lie Gin Asn Thr Asp Glu Glu Thr Lys Thr Gtu 
20 25 30 



Glu Gin Pro Gly Glu Glu Asp Gin Ala Val Ser Val Ser Phe Gly Asp 
35 40 45 • 



Pro Glu Gly Thr Ser Leu Gin Glu Glu Ser Leu Arg Asp Leu Val Cys 
50 55 60 



Tyr Cys Arg Ala Arg Gly Cys Lys Gly Arg Glu Arg Met Asn Gly Thr 
65 7 0 75 80 



Cys Arg Lys Gly His Leu Leu Tyr Met Leu Cys Cys Arg 
85 90 

(2) INFORMATION FOR SEQ 10 NO:29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 amino acids 
(8) TYPE: amino acid 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 

Met Lys Thr Leu lie Leu Leu Ser Ala Leu Val Leu Leu Ala Phe Gin 
1 5 10 15 



Val Gin Ala Asp Pro He Gin Asn Thr Asp Glu Glu Thr Lys Thr Glu ■ 
20 25 30 



Glu Gin Pro Gly Glu Asp Asp Gin Ala Val Ser Val Ser Phe Gly Asp 
35 40 45 



Pro Glu Gly Ser Ser Leu Gin Glu Glu Ser Leu Arg Asp Leu Val Cys 
50 55 60 



Tyr Cys Arg Thr Arg Gly Cys Lys Arg Arg Glu His Met Asn Gly Thr 
65, 70 75 • o0 



Cys Arg Lys Gly His Leu Met Tyr Thr Leu. Cys Cys Arg 
35 90 ' 

(2) INFORMATION FOR SEQ 10 NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 amino acids 
(8) TYPE: amino acid 
CD) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEO [0 NO:30: 

Met Lys Thr Leu Vat Leu Leu Ser Ala Leu Val Leu Leu Ala Phe Gin 
15 10 15 



val Gin Ala Asp Pro lie Gtn Asn Thr Asp Glu Glu Thr Lys Thr Glu 
20 25 30 



Glu Gin Pro Gly Glu Glu Asp Gin Ala Val Ser Val Ser Phe Gly Asp 
35 .40 45 



Pro Glu Gly Ser Ser Leu Gin Glu Glu Ser Leu Arg Asp Leu Val Cys 
50 55 60 



Tyr Cys Arg Lys Arg Gly Cys Lys Arg Arg Glu His Mer Asn Gly Thr 
65 70 75 80 



Cys Arg Lys Gly His Leu Leu Tyr Met Leu Cys Cys Arg 
85 90 

(2) INFORMATION FOR SEO ID NO:31: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 81 amino acids 
(8) TYPE: amino acid 
(0) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ 10 NO:31: 

Leu Ala Phe Gin Val Gin Ala Asp Pro He Gin Asn Thr Asp Glu Glu 
15 10 15 



Thr Lys Thr Glu Glu Gin Pro Gly Glu Asp Asp Gin Ala Val Ser Val 
20 25 30 



Ser Phe Gly Asp Pro Glu Gly Ser Ser Leu Gin Glu Glu Ser Leu Arg 
35 40 45 



Asp Leu Val Cys Tyr Cys Arg Lys Arg Gly Cys Lys Arg Arg Glu His 
50 55 60 



Met Asn Gly Thr Cys Arg Lys Gly His Leu Met Tyr Thr Leu Cys Cys 
65 70 75 80 



Arg 



(2) INFORMATION FOR SEQ ID NO:32: 

(i) SEQUEHCE CHARACTERISTICS: 
(A) LENGTH: 92 amino acids 
(8) TYPE: amino acid 
"(0} TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ 10 N0:32: 

Lys Thr Leu Val Leu Leu Ser Ala leu Val Leu Leu Ala Phe Gin Val 
15 10 15 
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Gin Ala Asp Pro lie Gin Asn Thr Asp Glu Glu Thr Lys Thr Glu Glu 
20 25 30 



Gin Pro Gly Glu Asp Asp Gin Ala Val Ser Val Ser Phe Gly Asp Pro 
35 40 . 45 



Glu Gly Sep Ser Leu Gin Glu Glu Ser Leu Arg Asp Leu Val Cys Tyr 
50 55 60 



Cys Arg Lys Arg Gly Cys LyiJLLy Arg Glu Arg Her Asn Gly Thr Cys 
65 70 75 80 



Arg Lys Gly His Leu Leu Tyr Thr Leu Cys Cys Arg 

85 ... 90 

INFORMATION FOR SEQ ID NO:33: 

(i) SEQUENCE CHARACTERISTICS:' 

(A) LENGTH: 85 amino acids 

( B) TYPE: amino acid 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 

Ala Leu Val Leu Leu Ala Phe Gin Val Gin Ala Asp Pro Ue Gin. Asn 
15 10 15 



Thr Asp Glu Glu Thr Lys Thr Glu Glu Gin Pro Gly Glu Glu Asp Gin 
20 25 30 



Ala Val Ser Val Ser Phe Gly Asp Pro Glu Gly Thr Ser Leu Gin Glu 
35 40 45 



Glu Ser Leu Arg Asp Leu Val Cys Tyr Cys Arg Ser Arg Gly Cys Lys 
50 55 60 



Gly Arg Glu Arg Met Asn Gly Thr Cys Arg Lys Gly His Leu Leu Tyr 

65 70 . 75 . 80 



Met Leu Cys Cys Arg 
85 

(2) INFORMATION FOR SEQ 10 NO:34: 

Ci) SEQUENCE CHARACTERISTICS: 
<A> LENGTH: 93 amino acids 
(3) TYPE; amino acid 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ 10 N0:34: 

Mec Lys Thr Leu lie Leu Leu Ser Ala Leu Val Leu Leu Ala Phe Gin 
1 5' 10 15 



Val Gin Ala Asp Pro lie Gin Asn Thr Asp Glu Glu Thr Lys Thr Glu 
20 25 30 
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Glu Gin Pro Gly<Glu Glu Asp Gin Ala Val Ser Val Ser Phe Gly Asp 
35 40 45 



Pro Glu Gly Thr Ser Leu Gin Glu Glu Ser Leu Arg Asp Leu Val Cys 
50 .55 60 



Tyr Cys Arg Ala Arg Gly Cys Lys Gly Arg Glu Arg Met Asn Gly Thr 
65 70 75 30 



Cys Arg Lys Gly His leu Met Tyr Thr Leu Cys Cys Arg 
85 90 

(2) INFORMATION FOR SEQ 10 N0:35: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 93 amino acids 
(3) TYPE: amino acid 
(0) TOPOLOGY: linear 



Cxi ) SEQUENCE DESCRIPTION: SEQ ID NO:35: 

Met Lys Thr Leu Val Leu Leu Ser Ala Leu Val Leu Leu Ala Phe Gin 

1 5 10. 15 



Val Gin Ala Asp Pro He Gin Asn Thr Asp Glu Glu Thr lys Thr Glu 
20 25 30 



Glu Gin Pro Gly Glu Glu Asp Gin Ala Val Ser Val Ser Phe Gly Asp 
35 40 45 



Pro Glu Gly Thr Ser Leu Gin Glu Glu Ser Leu Arg Asp Leu- Val Cys 
50 55 60 



Tyr Cys Arg Lys Arg Gly Cys Lys Arg Arg Glu His Met Asn Gly Thr 
65 70 75 SO 



Cys Arg Arg Gly His leu Met Tyr Thr Leu Cys Cys Arg 
as 90 

(2) INFORMATION FOR SEQ 10 NQ:36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 

(B) TYPE: amino acid 
(DJ TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ 10 N0:36: 

Ala Leu Val Leu Leu Ala Phe Gin Val Gin Ala Asp Pro lie Gin Asn 
15 10 15 



Thr Asp Glu Glu Thr Lys Thr Glu Glu GlnPro Gly Glu Glu Asp Gin 
20 25 30 



Ala Val Ser Val Ser Phe Gly Asp Pro Glu Gly Ser Ser Leu Gin Glu 
3S 40 45 



Gtu Ser Leu Arg Asp Leu Val Cys Tyr Cys Arg Thr Arg Gly Cys Lys 
50 55 60 
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Arg Arg Glu Arg Met Asn Gly Thr Cys Arg lys Gly His Leu Met His 
65 70 75 80 



Thr Leu Cys Cys Arg 
35 

(2) INFORMATION FOR SEQ 10 N0:37: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 93 amino acids 
(3) TYPE: amino acid 
(0) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ 10 N0:37: 

Met Lys Thr Leu Val Leu Leu Ser Ala Leu Val Leu Leu Ala Phe Gin 

.1 5 10 15 



Val Gin Ala Asp Pro lie Gin Asn Thr Asp Glu Glu Thr Lys Thr Glu 
20 25 30 



Glu Gin Pro Gly Glu Asp Asp Gin Ala Val Ser Val Ser Phe Gly Asp 
35 40 45 



Pro Glu Gly Ser Ser Leu Gin Glu Glu Ser Leu Arg Asp Leu Val Cys 
50 55 60 



Tyr Cys Arg Lys Arg Gly Cys Lys Arg Arg Glu His lie Asn Gly Thr 
65 70 75 80 



Cys Arg Lys Gly His Leu Leu Tyr Met Leu Cys Cys Arg 
85 90 

(2) INFORMATION FOR SEQ 10 NO:38: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 93 amino acids 
(8) TYPE: amino acid 
CO) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NQ:38: 

Met Lys Thr Leu tie Leu Leu Ser Ala Leu Val Leu Leu Ala Phe Gin 
■ 1 5 10 15 



Val Gin Ala Asp Pro tie Gin Asn Thr Asp Glu Glu Thr Lys Thr Glu 
20 25 30 



Glu Gin Pro Gly Glu Glu Asp Gin Ala Val Ser Val Ser Phe Gly Asp 
35 40 45 



Pro Glu Gly Thr Ser Leu Gin Glu Glu Ser Leu Arg Asp Leu Val Cys 
50 55 60 



Tyr Cys Arg Ser Arg Gly Cys Lys Gly Arg Glu Arg Met Asn Gly Thr 
65 70 75 80 
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Cys Arg Lys Gly His Leu Met Tyr Thr Leu Cys Cys Arg 
85 90 

(2) INFORMATION FOR SEQ ID NO:39: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: fl2 amino acids 
(8) TYPE: amino acid 
(D) TOPOLOGY: I inear 



(xi) SEQUENCE DESCRIPTION: SEQ 10 NQ:39: 

Leu Leu Ala Phe Gin Val Gin Ala Asp Pro He Gin Asn Thr Asp Glu 
1 5 10 15 



Glu Thr Lys Thr Glu Glu Gin Pro Gly Glu Glu Asp Gin Ala Val Ser 
20 25 30 



Val Ser Phe Gly Asp Pro Glu Gly Thr Ser Leu Gin Glu Glu Ser Leu 
35 40 45 



Arg Asp Leu Val Cys Tyr Cys Arg Lys Arg Gly Cys Lys Arg Arg Glu 
50 55 60 



His Met Asn Gly Thr Cys Arg Lys Gly His Leu Leu Tyr Thr Leu Cys 
65 70 75 B0 



Cys Arg 



(2) INFORMATION FOR SEQ 10 NO:40: 

<i) SEQUENCE CHARACTERISTICS: 
<A> LENGTH: 422 base pairs 
(B) TYPE: nucleic acid 
(C> STRANDEDNESS: double 
(0) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 



ACACATTGAG 


CTCCTGCTCA CCAATCCTCC AGGTGACTCC CAGCCATGAA GACACTAGTC 


60 


CTCCTCTCTG 


CCCTTGTCCT GCTGGCCTTC CAGGTCCACG CTGATCCTAT CCAAAACACA 


120 


GAT GAA GAGA 


CTAAAACTGA GGAGCAGCCA GGGGAAGACG ACCAGGCCGT ATCTGTCTCC 


ISO 


TTTGGAGACC 


CAGAAGGCAC TTCTCTTCAA GAGGAATCGT TGAGAGATCT GGTATGCTAT 


240 


TGTAGATCAA 


GAGGCTGCAA AGGAAGAGAA CGCATGAATG CGACCTGCAC AAAGGGTCAT 


300 


TTATTGTACA 


CGCTCTGCTG TCGCTGAACA TGGAGACCAC AGAGGACAAG ACGAACATGA 


36Q 


GTACTGAGGC 


CACTGATGCT GGTGCCTGAT GACCACTTCG CAATAAATTG TTCGCAATAT 


420 


GC 




422 



(2) INFORMATION FOR SEQ 10 NO:4t: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 422 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS; double 

(D) TOPOLOGY: linear 
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( x i ) SEQUENCE DESCRIPTION: SEo'lD NO:41: 

ACACACTCAC CCGCTACTCA CCAATCCTCC AGGTGACTCC CAGCCATGAA GACACTAATC 60 

CTCCTCTCTG CCCTCGTCCT GCTGGCCTTC CAGGTCCAGG CTGATCCTAT CCAAAATACA 120 

GATGAAGAGA CTAAAACTGA GAAGCAGCCA GGGGAAGAGG ACCAGGCCGT ATCTGTCTCC 1S0 

TTTGGAGACC CAGAAGGCTC TTCTCTTCAA GAGGAATCGT TGAGAGATCT GGTATGCTAT 240 

TGTAGAACAA GAGGCTGCAA AAGAAGAGAA CGCATGAATG GGACCTGCAG AAAGGGTCAT 300 

TTAATGTACA CGCTCTGCTG TCGCTGAACA TGGAGACCAC AGAGGACAAG ATGACCATGA 360 

GTACTGAGGC CACTGATGCT GGTGCC7GAT GACCACTTCG CAATAAATTG CTTGCAATAT 420 

GC 422 

(2) INFORMATION FOR SEQ 10 MO:42: 

(i> SEOUENCE CHARACTERISTICS; 
(A) LENGTH: 422 base pairs 
.•(B) TYPE: nucleic acid 
(C) STRANOEDNESS: double 
(0) TOPOLOGY: linear 

(xi) SEOUENCE DESCRIPTION: SEQ 10 NO:42: 

ACACATTGGG CTCCTGCTCA CCAATCCTCC AGGTGACTCC CAGCCATGAA GACACTAGTC 60 

CTCCTCTCTG CCCTCGTCCT GCTGGCCTTC CAGGTCCAGG CTGATCCTAT CCAAAACACA 120 

GATGAAGAGA CTAAAACTGA GGAGCAGCCA GGGGAAGACG ACCAGGCCGT ATCTGTCTCC IflO 

TTTGGAGACC CAGAAGGCTC TTCTCTTCAA GAGGAATCGT TGAGAGATCT GGTATGCTAT 240 

TGTAGAAAAA GAGGCTGCAA AAGAAGAGAA CGCATGAATG GGACCTGCAG AAAGGGTCAT 300 

TTAATGTACA CACTCTGCTG TCGCTGAACA TGGAGACCAC AGAGGACAAG ACGAACATGA 360 

GTACTGAGGC CACTGATGCT GGTGCCTGAT GACCACTTCG CAATAAATTG TTCGCAATAT 420 

GC 422. 

(2) INFORMATION FOR SEQ ID N0:43: 

Ci) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 365 base pairs 
' <B> TYPE; nucleic acid 
.(C) STRANOEDNESS: double 
(0) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ 10 NO:43: 

ACTAGTCCTC CTCTCT.GCCC TCGTCCTGCT GGCCTTCCAG GTCCAGGCTG ATCCTATCCA 60 

AAATACAGAT GAAGAGACTA AAACTGAGGA GCAGCCAGGG GAAGAGGACC AGGCCGTATC 120 

TGTCTCCTTT CGAGACCCAG AAGGCTCTGC TCTTCATGAA AAATCTTTGA GAGGTTTGTT 180 

ATGCTATTGT AGAAAAGGAC ACTGCAAAAG AGGAGAACGA GTTCGTGGGA CTTGTGGAAT 240 

ACGATTTTTG TACTGCTGCC CCCGCCGCTG AACATGCAGA TGACAAACAT ATGACAACCA 300 

TTGTCTCTCA GGCCGCTGAT GCCGGGGCCT CATGACCACT TCTCAAGAAA TGTTTGCAAT 360 

ATGCA 365 
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(2) INFORMATION FOR SEQ ID N0:44: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 421 base parrs 
.(3) TYPE : nucleic acid 
(C) STRANQEDNESS: double 
(0) TOPOLOGY: linear 



(xi) SEQUENCE OESCRfPTlON; SEQ 10 NO:44; 



ACACATTGGG CTCCTGCTCA 


CCAATTCTCC 


AGGTGACCCC CAGCCATGAA GACATTTGTC 


60 


CTCCTCTCTG CCCTTGTCCT 


GCTGGCCTTC 


CAGGTCCAGG CTGATCCTAT 


CCACAAAACA 


120 


GATGAAGAGA CTAATACTGA 


GGAGCAGCCA 


GGGGAAGAGG ACCAGGCCGT 


ATCTATCTCC 


180 


TTTGGAGCCC AAGAAGGGTC 


TGCTCTTCAT 


GAGGAATTGT CAAAAAAGCT 


GATATGCTAT 


240 


TGTAGAATAA GAGGCTGCAA 


AAGAAGAGAA 


CGCGTTTTTG GGACCTGCAG 


AAATCTTTTT 


300 


TTAACTTTCG TATTCTGCTG 


CAGCTGAATA 


TGCAGATGAC AAAGATATGA 


CAACCATCAG 


360. 


CACTGAGGCC ACTGATGCTG 


GGGTCTGATG 


ATCACCTCGC AATAAATTGT 


TCGCAATATG 


420 


C 








421 



(2) INFORMATION FOR SEQ 10 NO.-45: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 422 base pairs 
(8) TYPE: nucleic acid 
(C) STRANOEONESS: double 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ 10 NO:45: 



ACACACTGAG CTGCTACTCA 


CCAATCCTCC 


AGGTGACTCC 


CAGCCATGAA 


GACACTAATC 


60 


CTCCTCTCTG CCCTCGTCCT 


GCTGGCCTTC 


CAGGTCCAGG 


CTGATCCTAT 


CCAAAATACA 


120 


GATGAAGAGA CTAAAACTGA 


GGAGCAGCCA 


GGGGAAGAGG 


ACCAGGCCGT 


ATCTGTCTCC 


180 


TTTGGAGACC CAGAAGGCAC 


TTCTCTTCAA 


GAGGAATCAT 


TGAGATATCT 


GGTATGCTAT 


240 


TGTAGAGCAA GAGGCTGCAA 


AGGAAGAGAA CGCATGAATG GGACCTGCAG AAAGGGTCAT 


300 


TTATTGTACA TGCTCTGCTG 


TCGCTGAACA 


TGGAGACCTC 


AGAGAACAAG 


ACGACCATGA 


360 


GTAC7GAGGC CACTGATGCT 


GGTGCCTGAT 


GACCACTTCG 


CAATACATTG 


TTCGCAATAT 


420 


GC 










422 



<2) INFORMATION FOR SEQ 10 NO:46: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 420 base pairs 
(8) TYPE: nucleic acid 
(C) STRANOEONESS: double 
(0) TOPOLOGY: I inear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0/.46: 

ACACTGAGCT GCTACTCACC AATCCTCCAG G7GACTCCCA GCCATGAAGA CAC7AA7CC7 60 

CCTCTCTGCC CTCGTCCTGC TGGCCTTCCA GGTCCAGGCT GATCCTATCC AAAACACAGA 120 

TGAAGAGACT AAAACTGAGC AGCAGCCACG GGAAGACCAC CAGGCCGTAT CTQT CTCCTT 180 
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TGGAGACCCA GAAGGCTCTT CTCTTCAAGA GGAATCGTTG AGAGATCTGG TATGCTATTG 240 

TAGAACAAGA GGCTGCAAAA GAAGAGAACA CATGAATGGG ACCT GCAGAA AGGGTCATTT 300 

AATGTACACG CTCTGCTGTC GCTGAACATG GAGACCTCAG AGAACAAGAC GACCATGAGT 360 

ACTGAGGCCA CTGATGCTGG 7GCCTGATGA CCACTTCGCA ATAAATTCTT CGCAATATGC 420 

(2) INFORMATION FOR SEQ 10 NO:47: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LEMGTH: 342 base pairs 
(8) TYPE: nucteic acid 
(C) STRANDEONESS: double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ 10 NO:47: 

GCTGGCCTTC CAGGTCCAGG CTGATCCTAT CCAAAACACA GATGAAGAGA CTAAAACTGA 60 

GGAGCAGCCA GGGGAAGACG ACCACGCCGT ATCTGTCTCC TTTGGAGACC CAGAAGGCTC 120 

TTCTCTTCAA GAGGAATCGT TGAGAGATCT GGTATGCTAT TGTAGAAAAA GAGGCTGCAA 1S0 

AAGAAGAGAA CACATGAATG GGACCTGCAG AAAGGGTCAT TTAATGTACA CGCTCTGCTG 240 

TCCCTGAACA TGGAGACCAC AGAGGACAAG ACAAGCATGA GTACTGAGGC CACTGATGCT 300 

GGTGCCTGAT GACCACTTCG CAATAAATTG TTCGCAATAT GC 342 
(2) INFORMATION FOR SEQ ID NO:48:' 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 377 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS : double 
(O) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4a: 

ATGAAGACAC TAGTCCTCCT CTCTGCCCTC GTCCTGCTGG CCTTCCAGGT CCAGGCTGAT 60 

CCTATCCAAA ACACAGATGA AGAGACTAAA ACTGAGGAGC AGCCAGGGGA AGAGGACCAG 120 

GCCGTATCTG TCTCCTTTGG AGACCCAGAA GGCTCTTCTC TTCAAGAGGA ATCGTTGAGA 180 

GATCTGGTAT GCTATTGTAG AAAAAGAGGC TGCAAAAGAA GAGAACACAT GAATGGGACC 240 

TGCACAAAGG GTCATTTATT GTACATGCTC TGCTGTCGCT GAACATGGAG ACCACAGAGG 300 

ACAAGATGAA CATGAGTACT GAGGCCACTG ATGCTGGTGC CTGATGACCA CTTCGCAATA 360 

AATTGTTCGC AATATGC 377 

(2) INFORMATION FOR SEQ 10 NO:49: 

(t> SEQUENCE CHARACTERISTICS: 
(A) LENGTH : 375 base pairs 
(8) TYPE: nucleic acid 
(C). STRANDEONESS: double 
<D> TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEO ID N0:49: 
GAAGACACTA GTCCTCCTCT CTGCCCTCGT CCTGCTGGCC TTCCAGGTCC AGGCTGATCC 60 
TATCCAAAAC ACAGATGAAG AGACTAAAAC TGAGGAGCAG CCAGGGGAAG ACGACCAGGC 120 
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CGTATCTCTC TCCTTTGGAC ACCCAGAAGG CTCTTCTCTT CAAGAGGAAT CGTTGAGAGA 180 

TCTGGTATGC TATTGTAGAA AAAGAGGCTG CAAAGGAAGA GAACGCATGA ATGGAACCTG 240 

CAGAAAGGGT CATTTATTGT ACACGCTCTG CTCTCGCTGA ACATGGAGAC CACAGAGGAC 300 

AAGACGAACA TGAGTACTGA GGCCACTGAT GCTGGTGCCT GATGACCACT TCGCAATAAA 360 

TTGTTCGCAA TATGC 373 

(2) INFORMATION FOR SEQ 10 NO:50: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 352 base pairs 
<B3 TYPE: nucleic acid 
CO STRANDEDNESS; double 
(0) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ 10 N0:50: 

CCCTCGTCCT GCTGGCCTTC CAGGTCCAGG CTGATCCTAT CCAAAACACA GATGAAGAGA 60 

CTAAAACTGA CGAGCAGCCA GGGGAAGAGG ACCAGGCCGT ATCTGTCTCC TTTGGAGACC 120 

CAGAAGGCAC TTCTCTTCAA GAGGAATCGT TGAGAGATCT GGTATGCTAT TGTAGATCAA 180 

GAGGCTGCAA AGGAAGAGAA CGCATCAATG GAACCTGCAG AAAGGGTCAT TTATTGTACA 240 

TGCTCTGCTG TCGCTGAACA TGGAGACCAC AGAGAACAAG ACGACCATGA GTACTGAGGC 300 

CACTGATGCT GGTGCCTGAT GACCACTTCG CAATACATTG TTCGCAATAT GC 352 

(2) INFORMATION FOR SEQ 10 NO:5l: 

(i) SEQUENCE CHARACTERISTICS; 
(A) LENGTH: 422 base pairs 
<B> TYPE: nucleic acid 
(O STRANDEDNESS: double 
(0) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:51: 

ACACATTGGG CTCCTGCTCA CCAATCCTCC AGGTGACTCC CAGCCATGAA GACACTAGTC 60 

CTCCTCTCTG CCCTCGTCCT GCTGGCCTTC CAGGTCCAGG CTGATCCTAT CCAAAACACA 120 

GATGAAGAGA CTAAAACTGA CGAGCAGCCA GGGGAAGAGG ACCAGGCCGT ATCTGTCTCC 180 

TTTGGAGACC CAGAAGGCAC TTCTCTCCAA GAGGAATACT TGAGAGATCT GGTATGCTAT 240 

TGTAGAGCAA GAGGCTGCAA AGGAAGAGAA CGCATGAATG GGACCTGCAG AAAGGGTCAT 300 

TTAATGTACA CGCTCTGCTG TCGCTGAACA TGGAGACCTC AGAGAACAAG ACGACCATGA 360 

GTACTGAGGC CACTGATGCT GGTGCCTGAT GACCACTTCG CAATAAATTG TTCGCAATAT 420 

GC 422 

(2) INFORMATION FOR SEQ ID NO:52: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 388 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS: double 
<0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:52: 
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GACTCCCAGC CATGAAGACA CTAGTCCTCC TCTCTGCCCT TGTCCTGCTG GCCTTCCAGG 60 

TCCAGGCTGA TCCTATCCAA AACACAGATG AAGAGACTAA AACTGAGGAG CAGCCAGGAG 120 

AAGAGGACCA GGCCGTATCT GTCTCCTTTG GAGACCCAGA AGGCACTTCT CTTCAAGAGG 180 

AATCGTTGAG AGATCTGGTA TGCTATTGTA GAAAAAGAGG CTGCAAAAGA AGAGAACACA 240 

TGAATGGGAC CTGCAGAAGG GGTCATT-TAA TGTACACACT CTGCTGTCGC TGAACATGGA 300 

GACCACAGAG GACAAGACGA ACA7GAGTAC TGAGGCCACT GATGCTGGTG CCTGATGACC .360 

ACCTCGCAAT AAATTGTTCG CAATATGC 3S8 

(2) INFORMATION FOR SEC ID NO:53: 

<i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 352 base pairs 
(8) TYPE: nucleic acid ' 
(C) STRANOEDNESS: double 
(0) TOPOLOGY: linear 

<*i) SEQUENCE DESCRIPTION: SEQ IQ NO:53: 

CCCTCGTCCT GCTGGCCTTC CAGGTCCAGG CTGATCCTAT CCAAAACACA GATGAAGAGA 60 

CTAAAACTGA GGAGCAGCCA GGGGAAGAGG ACCAGGCCGT ATCTGTCTCC TTTGGAGACC 120 

CAGAAGGCTC TTCTCTTCAA GAGGAATCGT TGAGAGATCT GGTATGCTAT TGTAGAACAA 180 

GAGGCTGCAA AAGAAGAGAA CGCATGAATG GGACCTGCAG AAAGGGTCAT TTAATGCACA 240 

CGCTCTGCTG TCGCTGAACA TGGAGACCAC AGAGGACAAG ACGAGCATGA GTACTGAGGC 300 

CACTGATGCT GGTGCCTGAT GACCACTTCG CAATAAATTG TTCGCAAAAT GC . 352 

(2) INFORMATION FOR SEQ ID NO:54: 

(i) SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 401 base pairs 

(B) TYPE : nucleic acid 
CO STRANOEDNESS: double 
(0) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO.-S4: 

CCAATCCTCC CAGTGACTCC CAGCCATGAA GACACTAGTC CTCCTCTCTG CCCTTGTCCT 60 

GCTGGCCTTC CAGGTCCAGG CTGATCCTAT CCAAAACACA GATGAAGAGA CTAAAACTGA 120 

GGAGCAGCCA GGGGAAGACG ACCAGGCCGT ATCTGTCTCC TTTGGAGACC CAGAAGGCTC 180 

TTCTCTTCAA GAGGAATCGT TGAGAGATCT GGTATGCTAT TGTAGAAAAA GAGGCTGCAA 240 

AAGAAGAGAA CACATAAATG GGACCTGCAG AAAGGGTCAT TTATTGTACA CTCTCTGCTG 300 

TCGCTGAACA TGGAGACCAC AGAGGACAAG ATGACCATGA GTACTGAGGC CACTGATGCT 360 

GGTGCCTGAT GACCACTCGC AATAAATTGT TCGCAATATG C 4Q1 



(2) INFORMATION FOR SEQ 10 N0:55: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 391 base pairs 
C B ) TYPE : nucleic acid 
(C) STRANOEDNESS: double 
(0) TOPOLOGY: linear 
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<xi) SZQUSHCZ DESCRIPTION: SEQ 10 NO:55: 

GGTGACTCCC AGCCATGAAG ACACTAATCC TCCTCTCTGC CCTCGTCCTG CTGGCCTTCC 60 

AGGTCCAGGC TGATCCTATC CAAAACACAG ATGAAGAGAC TAAAACTGAG GAGCAGCCAG 120 

GAGAAGAGGA CCAGGCCGTA TCTGTCTCCT TTQQAQACCC AGAAGGCACT TCTCTTCAAG 180 

AGGAATCGTT GAGAGATCTG GTATGCTATT GTAGATCAAG AGGCTGCAAA GGAAGAGAAC 240 

GCATGAATGG GACCTGCAGA AAGGGTCATT TAATGTACAC GCTCTGCTGT CGCTGAACAT 300 

i 

GGAGACCTCA GAGAACAAGA CGACCATGAG TACTCAGGCC ACTGATCCTG GTGCCTGA7G . 360 

ACCACTTCGC AATAAATTGT TCGCAATATG C 391 

(2) IN FORMAT ION FOR SEQ 10 NO:56: 

(t) SEQUEMCS CHARACTER 1ST ICS: 
(A) LENGTH: 342 base pairs 
(8) TYPE : nucleic acid 
CO STRAN0E0NESS: double . 
(0) TOPOLOGY: linear 

Cxi) SEQUENCE DESCRIPTION: SEQ 10 NQ:56: 

GCTGGCCTTC CAGGTCCAGG CTGATCCTAT CCAAAATACA GATGAAGAGA CTAAAACTGA 60 

GGAGCAGCCA GGAGAAGAGG ACCAGGCCGT ATCTGTCTCC TTTGGAGACC CAGAAGGCAC 120 _ 

TTCTCTTCAA GAGGAATCGT TGAGACATCT GGTATGCTAT TGTAGAAAAA GAGGCTGCAA 180 

AAGAAGAGAA CACATGAATG GGACCTGCAG AAAGGGTCAT TTATTGTACA CGCTCTGCTG 240 • 

TCGCTGAACA TGGAGACCAC AGAGGACAAG ATGACCATGA GTACTGAGGC CACTGATGCT 300 

GGTGCCTGAT GACCACCTCG CAATAAATTG CTTGCAATAT GC 342 

(2) INFORMATION FOR SEQ ID NO:57: 

<i> SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 403 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEONESS: double 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ 10 NO:57: 

ACACATGGCT CTCTCACCAA TCCTCCAGGT GACTCCCAGC CATGAAGACA CTAGYCCTCC 60 

TCTCTGCCTG TCCTGCTGGC CTTCCA'GGTC CAGGCTGATC CTATCCAAAA CACAGATGAA 120 

GAGACTAAAA CTGAGGACCA GCCAGGGGAA GAGACCAGGC TGTGTCTGTC TCTTTTGGAG 180 

ACCCAGAAGG CCTTCTCTTC AAGAGGAATC GTTGAGAGAT CTGGTATGCT ATTGTAGAAA 240 

GAGGCTGCAA AGAAGAGAAC CATGAATGGG ACCTGCACAA AGGGTCATTT ATGTACAGCT 300 

CTGCTGTCGC TGAACATGGA GACCCAGAGA CAAGAACATG 4GTACTGAGG CCACTGATGC 360 

TGGTGCCTGA TGACCACTTC TCAATAAATT GTTCGCAATA TGC 403 

(2) INFORMATION FOR SEQ 10 NC:58: 

(i) SEQUENCE CHARACTERISTICS: 
(A)- LENGTH: 419 base pairs 
(8) TYPE: nucleic acid 
<C) STRANOEONESS: single 
(0) TOPOLOGY: linear 
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(ix) FEATURE: 

(A) NAME/KEY: misc feature 
(3) LOCATION: 279.7286 

<0) OTHER INFORMATION: /notes «N represents DNA chat was 
not sequenced." 



(xi) SEQUENCE' DESCRIPTION: SEQ ID N0:58: 



TATAAATCCA 


GGCTGGATAT TCACTGTCCA 


CACATTGAGC 


TCCTGCTCAC 


CAATCCTCCA 


60 


GCTCACTCCC 


AGCCATGAAG ACACTAGTCC 


TCCTCTCTGC 


CCTTGTCCTG 


CTGGCCTTC: 


120 


AGGTCCAGGC 


TGATCCTATC CAAAACACAG 


ATGAAGAGAC 


TAAAACTGAG 


GAGCAGCCAG 


180 


GAGAAGAGGA 


CCAGGCCGTA TCTGTCTCCT TTGGAGACCC 


AGAAGGCACT 


TCTCTTCAAG 


240 


AGGAATGTGA 


GTACTGGTGT CCAGAGTGAT 


GGATGCTTNN 


NNNNNNTTTT 


GTATCTCCAG 


300 


CGTTGAGAGA 


TCTGGTATGC TATTGTAGAT 


CAAGAGGCTG 


CAAAGGAAGA 


GAACGCATGA 


360 


ATGCAACCTG 


CAGAAAGGGT CATTTATTGT 


ACACGCTCTG 


CTGTCGCTGA 


ACATGGAGA 


419 



(2) INFORMATION FOR SEQ 10 N0:S9: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 419 base pairs 
(8) TYPE: nucleic acid 
(C) STRAND EONESS : single 
(0) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/KEY: misc feature 
(8) LOCATION: 279.7286 

(D) OTHER INFORMATION: /note= "N represents ONA that was 
not sequenced." 



(xi) SEQUENCE DESCRIPTION: SEQ 10 N0:59: 

TATAAATCCA GACTGGCTCC TCACTCTCCA CACATTGGGC TCCTGCTCAC CAATCCTCCC 60 

AGTGACTCCC AGCCATGAAG CCACTTGTCC TCCTTTCTGC CCTTGTCCTA CTGTCCTTTC 120 

AGGTCCAGGC TGATCCTATC CAAAACACAG ATGAAGAGAC TAAAACTGAG GAGCAGTCAG 180 

GTGAAGAGGA CCAGGCTGTG TCTGTCTCCT TTGGAGACCC AGAAGGCGCT TCTCTTCAAG 240 

AAGAATGTGA GTACTGGTGC CCAGTGTGAT GGAJGCTTNN NNNNNNTTTT GTGTCTCCAG 300 

. CGTTGAGAGA TCTGGTATGC TATTGTAGAA CAAGAGGTTG CAAAAGAAGA GAACGCATGA 360 

ATGGGACCTG CAGAAAGGGT CATTTAATGT ACACGCTCTG GTGCCGCTGA ACATGGAGA 419 

(2) INFORMATION FOR SEQ ID M0:60: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 419 base pairs 
(8) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/KEY: misc feature 
(3 ) LOCATION: 279.72B6 

(0) OTHER INFORMATION: /noce* 11 N represents ONA chat was 
not sequenced. 11 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:60: 
TATAAATCCA GGCTGGATAT TCACfCTCCA CACATTGGGC TCCTGCTCAC CAATCCTCCA 60 
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CCTCACTCCC AGCCATGAAG 


ACACTAGTCC 


TCCTCTCTGC 


CCTCGTCCTG CTGGCCTTCC 


120 


AGGTCCAGGC TGATCCTATC 


CAAAACACAG 


ATGAAGAGAC 


TAAAACTGAG GAGCAGCCAG 


180 


GGGAAGACGA CCAGGCTGTG 


TCTGTCTCTT 


TTGGAGACCC 


AGAAGGCTCT TCTCTTCAAG 


240 


AGGAATGTGA GTATTGGTGT 


CC1MGTGAT 


GGATGCTTNN 


HHHNHMTn GTGTCTCCAG 


300 


CGTTGAGAGA TCTGGTATGC 


TATTGTAGAA 


AAAGAGGCTG 


CAAAAGAAGA GAACGCATGA 


360 


ATGGGACCTG CAGAAAGGGT 


CATTTAATGT 


ACACACTCTC 


CTGTCGCTGA ACATGGAGA 


419 



<2> IN FORMAT ION FOR S£Q 10 M0:6J: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 419 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEQHESS: single 
(0) TOPOLOGY: t inear 



< ix) FEATURE: 

(A) NAME/KEY: misc feature 
(9) LOCATION: 279.7286 

CO) OTHER INFORMATION: /note* "N represents ONA that was 
not sequenced." 



(xi) SEQUENCE OESCRIPTION: SEQ 10 N0:61: 



TATAAATGCA 


AGTTGGCTAC 


TCACTCTCCA 


CACATTGGGC 


TCCTGCTCAC 


CAATTCTCCA 


60 


GGTGACCCCC 


AGCCATGAAG 


ACATTTGTCC 


TCCTCTCTGC 


CCTTGTCCTG 


CTGGCCTTCC 


120 


AGGTCCAGGC 


TGATCCTATC 


CACAAAACAG ATGAAGAGAC 


TAATACTGAG 


GAGCAGCCAG 


ISO 


GGGAAGAGGA 


CCAGGCTCTC 


TCAGTCTCCT 


TTGGAGGCCA 


AGAAGGCTCT 


GCTCTTCATG 


240 


AAGAATGTGA 


GTAGTGGTAC 


GCAGTGTGAT 


GGATGCTTNN 


NNNNNNTTTT 


GTGTCTCCAG 


300 


TGTCAAAAAA 


GCTGATATGC 


TATTGTAGAA 


TAAGAGGCTG 


CAAAAGAAGA 


GAACGCGTTT 


360 


TTGGGACCTG 


CAGAAATCTT 


TTTTTAACTT 


TCGTATTCTG 


CTGTAGCTGA 


ATATGCAGA 


419 



(2) INFORMATION FOR SEQ 10 NO:62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 419 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 
(0) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION: 279.7286 

(0) OTHER INFORMATION: /note= "N represents ONA that was 
not sequenced." 



(xi) SEQUENCE DESCRIPTION: SEQ 10 NQ:62: 



TATAAATGCA 


CGCTGGATAT 


TCACTCTCCA 


CACACTGAGC TGCTACTCAC 


CAATCCTCCA 


60 


GGTGACTCCC 


AGCCATGAAG 


ACACTAATCC 


TCCTCTCTGC CCTCGTCCTG 


CTGGCCTTCC 


120 


AGGTCCAGGC 


TGATCCTATC 


CAAAATACAG 


ATGAAGAGAC TAAAACTGAG 


GAGCAGCCAG 


180 


GGGAAGAGGA 


CCAGGCTGTG 


TCTGTCTCTT 


TTGGAGACCC AGAAGGCACT 


TCTCTTCAAG 


240 


AGGAATGTGA 


GTACTGGTGT 


CCAGTGTGAT 


GGATGCTTNN NNNNNNTTTT 


GTGTCTCCAG 


300 


CATTGAGAGA 


TCTGGTATGC 


TATTCTAGAG 


CAACAGCCTG CAAAGGAAGA 


GAACGCATGA 


360 
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ATCGGACCTG CAGAAAGGGT CATTTATTGT ACATGCTCTG CTGTCGCTGA ACATGGAGA 419 

(2) INFORMATION FOR SEQ ID NO:63: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 419 base pairs 
(8) TYPE: nucleic acid 
(C) STRANOEONESS: single 
(0) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/KEY: misc feature 
CB) LOCATION: 279.7286 

(0) OTHER INFORMATION: /note= "H represents ONA that was. 
not sequenced." 



Cxi) SEQUENCE DESCRIPTION: SEQ 10 NO:63: 

ACTTGAGGGT AACAGCCTCT CCCAATTCCA CACATTGAGC TCCTGCTCAC CAATCCTCCA 60 

GGTGACTCCC ACCCATGAAG ACACTAGTCC TCCTCTCTGC CCTTGCCCTG CTGGCCTTCC 120 

AAGTCCAGGC TGATCCTATC CAAAACACAG ATGAAGAGAC TAAAACTGAG GAGCAGCCAG 180 

GGAAAGAAGA CCAAGCTGTT TCTGTCTCCT TTGGAGACCC AGAAGGCTCT TCTCTTCAAG 240 

AGGAATGTGA GTACTGGTGC CCAGTGTGAT GGATGCTTNN NNNNNNTTTT GTGTCTCCAG . 3G0 

CGTTGAGAGA TCTGATATGA TATTGTAGAA CAACAGGCTG CAAAAGAAGA GAACGCCTGA 360 

ATGGCACCTG AAGAAAGGGT CATTTATTGT ACATGCTCTG CTGCTGCTGA ACATGGAGA 419 

(2) INFORMATION FOR SEQ 10 NO:64: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 411 base pairs 
(8 } TYPE: nucleic acid 
<C) STRANOEONESS: single 
(0) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:64; 

TATAAATGCA RRCTGGMTMY TCACTNTCCA CACATTGRGC TCCTGCTCAC CAATCCTCCA 60 

GGTGACTCCC AGCCATGAAG ACACTUGTCC TCCTCTCTGC CCTYGTCCTG CTGGCCTTCC 120 

AGGTCCAGGC TGATCCTATC CAAAAHACAG ATGAAGAGAC TAAAACTGAG GAGCAGCCAG 180 

GOGAAGARGA . CCAGGCTGTO TCTGTCTCYT TTGGAGACCV AGAAGGCOCT TCTCTTCAAG 240 

ARGAATGTGA GTASTCGTGY CCAGTGTGAT GGATGCTTTT TTGTGTCTCC AGCGTTGAGA 300 

GATCTGRTAT GCTATTGTAG AOHAAGACGC TGCAAARGAA GAGAACGCVT GAATGGGACC 360 

TGCAGAAAGG GTCATTTAUT GTACANNCTC TGCTGYRGCT GAACATGGAG A 411 

(2) INFORMATION FOR SEQ ID NO:65: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 445 base pairs 
(S) TYPE: nucleic acid 
(C) STRANOEONESS: double 
(0) TOPOLOGY: linear 

(xi) SEOUENCE DESCRIPTION: SEQ ID NO:65: 
ACACTGGTCT CCACCTCACC AATCCTCCAG GTGACUCCA GCCATGAAGA CTCTTGTCCT 60 
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CCTCTCTGCC CTTGTCCTGC TGGCATTCCA GGTCCAGGCT GATCCCATTC AAGAGGCAGA 120 

AGAAGACACT AAAACTGAGG AGCAGCCAGC ACATGAGGAC CAGGATGTGT CTGTCTCCTT 130 

TGAAGGCCCA GAACCCTCTG CTCTTCAAAA TTTAGAGA TA GGATGGCCAT TAAAGCAGTG 240 

CCAf TGCCGA AAGTTCTGCA GACCTTATGA AAAGGCCGAG GGGTCCTGTC GTCCAGGTCT 300 

ATTTATAAAA CGCAAAATCT GCTGCATACA ACAATGGACA CCAGGGAGGA CATAACCACG 360 

TGAACTGGGA CCTCACAATC TGTCATTCTT GCGCTrCAAC TCGACTGCTT TTCCTTCTCC 420 

AATAAACCCC TTGCAGACAA AAAAA W 
(2) IN FORMAT IOM FOR SEQ 10 N0:66: 

(i) sequence character tsncs: 

(A) LENGTH: 445 base pairs 

(8) TYPE: nucleic acid 

CO S7RAN0E0NESS: double 

(0) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:66: 

ACACTGGTCT CCAGCTCACC AATCCTCCAG GTGACTTCCA GCCATGAAGA CTCTTGTCCT 60 

CCTCTCTGCC CTTGTCCTGG TGGCCTACCA GGTCCAGGCT GATCCCATTC AAGCGGCAGA 120 ' 

AGAAGAGACT AAAACTGAAG AGCAACCATC AGATGAGGAC CAGGATGTGT CTGTCTCCTT 180 

TGAAGGCCCA GAAGCCTCTG CTCTTCAAGA TTTTGAGATA GGAAGGCCAG TGAGGAGGTG 240 

CCGTTGCACA CCAAACTGCG GACCTAAAGA ATATGCCACT GCGTTCTGTG CTCAAGGTCC 300 

ATTTAAACAG TTCAAATTCT GCTGCACATG AACATGGATC CCAAGTCTGA GATAACCACG 360 

TGCTCTGGGA CCTCACAATC TGTCATTATT CTGCTTGACG TCAACTGCTT TTCCTTCTCC 420 

AATAAACTCC TGGCAGACAA AAAAA 445 

(2) INFORMATION FOR SEQ ID NO:67: 

(U SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 445 base pairs 
(8) TYPE : nucleic acid 
CO STRANOEDNESS: double 
(0) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SCO ID N0:67: 

ACACTGGTCT CCAGCTCACC AATCCTCCAG GTGACTTCCA GCCATGAAGA CTCTTGTCCT 60 

CCTCTCTGCC CTTGTCCTGC TGGCATTCCA GATCCAGGCT GATCCCATTC AAGAGGCAGA 120 

AGAAGAGACT AAAACTGAGG AGCAGCCAGC AGATGAGGAC CAGGATGTGT CTGTCTCCTT ISO 

TGAAGGCCCA GAACCCTCTG CTCTTCAAAA TTTAGAGATC AGATGGCCAT GGAAGAGGTG 24Q 

CCATTGCAGA AGTTTCTGCA GACCTTATGA AAATGCCACT TCGTTCTGTG CTCAAGGTCT 300 

ATTTAAACAA CACAAATTCT GCTGCCTAGA AACATGGCCC CCAAGGATGA AAT.AACCACG 360 

TGCTCTGGGA CCTCACAATC TGTCArCAfT GTGCTTGCCC TCAACTTCTT TTCCTTCTCC 420 

AATAAACTCC TTGCAGACAA AAAAA W 
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(2>. INFORMATION FOR SEQ 10 N0:68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 2457 base pairs 
£ B ) TYPE: nucleic acid 
(C) STRAND EONESS : double 
(0) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEO 10 NQ:6fl: 



CCTGAGACCA 


ACTCTGTGAT 


AATCAGAAAA 


GTCAATAATG 


TGTCTGAAAT GTAAGGTGTG 


60 


CTTCTTGACT 


GATAGTTCTA 


AGCCTACAGA 


GAGATTCATG 


TGGTCATATC CCATTTAACA 


120 


ATGATATATA 


TGTTAAATAT 


ATAAAGATAT 


ATGTATCTTC 


AGTATGTATG TTCAATATGT 


180 


ATGTAAATAA 


TATTCTTGCT 


GCTTCACTAG 


rTTTTArATA 


r-APPTftT AAf! TAAAAAPATT 
Unub 1 U 1 AAU lAAAAALnl 1 






AATAGTATTT 


AT TAACATKT 
n i i nn \__n i vj I 


aaa t xc.cxr.r 

nAn 1 KUuMuL 


\ uuLALL • u 1 uALAu 1 uuun 


300 


CTCCATACAG 


TGACTGTAAA 

1 Unv* 1 u * n/\n 


CAACAGGATG 


LI L i UUALL 1 


1 i 1 UL lul Ul Ullll UU 1 UAl) 


360 


kc.aca Tr.nr.A 

n ijn v. n I uvjun 


TAAACACAGA 


rTfiiicirrr. 

I. I UnniVlAU 1 \j 


1 1 • LAA i u 


APATrr.rr.nr APTTPTrr.Ar. 

ALAILULuuL AL 1 1 L J LuAU 






AGCTTCTGAG 
nub * i v. i uau 


rrTCTCTAPA 
1>L iwlLI ALA 


1 I Li 1 wUA 1 (j J 


LL 1 1 1 LL I u 1 AbU 1 LAUu I L 




T PA TTTTCTi 


Ann U 1 AAAAL 


LA 1 1 LLALLA 


1 L ILAUALLT 


GGGAAAlALL LLATGGliKj 




ALUU I LL 1 LA 


ULA 1 UAAUAu 


LLALL 1 LLAG 


CTlAlTCTTG 


GCAGATGTGT TClATGAlTT 


600 


TGGCTTCTTC 


AGAACAACCC 


ACTACAGCTT 


CACTCTGACA 


aatcctagaa ACTTGAACTC 


660 


AATTCACTAG 


*rf*rr »ri*i t 

AGGGCACCAT 


AAAGCCATCA 


TACCTTATAA 


TGGCCCCAAA GGAGGTGATT 


770 


CACAAAGTTT 


GCCTTGATGA 


GGACAATTGC 


TAATACACAA 


AAACTTGCAA AAAAAAATTG 


780 


AGTGTCCAGT 


CCACCTGGTC 


AAGGACTGGT . 


CCCGGATCCA 


CAGTTTCTGA GAATAGCAGG 


840 


CTCTAACTTG 


AAAACACAAA 


AATTGTTTGT 


TCTATGAGCT 


CATTAAATTA GGCAGTGTTC 


900 


A r* T 4 T T T T 

AGCTATTTTC 


TTTCCTGACC 


ACTGAGAGGT 


AAATACTCAA 


GCAGATGGGA AACAGGGGAG 


960 


GACAGTAAAG 


CCTGTTCATC 


ATTATCAGTG 


GGAGTGTGCA 


TGAGGGGAGG GGTGTCAGTG 


1020 


AACACACAGA 


GCATCAGGAA 


GGAAGCCTTG 


AGGACAGAGG 


AACATCAAAG GGATCCTGAG 


1080 


GACAACAGCT 


GGGAGCAGTT 


GCCATCAATG 


AGTGCCTTCT 


CTAAGTATGG GGCATGTTCT 


1140 


TTGCCCTATA 


AATGCAGGCT 


GGCTTCTCTC 


TCCACACAC7 


GGTCTCCAGC TCACCAATCC 


1200 


TCCAGGTGAC 


TTCCAGCCAT 


GAAGACTCTT 


GTCCTCCTCT 


CTGCCCTTGT CCTGCTGGCA 


1260 


TTCCAGGTCC 


AGGCTGATCC 


CATTCAAGAG 


GCAGAAGAAG 


AGACTAAAAC TGAGGAGCAG 


1320 


CCAGCAGATG 


AGGACCAGGA 


TGTGTCTGTC 


TCCTTTGAAG 


GCCCAGAACC CTCTGCTCTT 


1380 


CAAAATTTAG 


GTGCGTGCTT 


GTGCACAGAA 


TGATGGAGGC 


TTGGAGTCTC CTCATGGAGG 


14A0- 


GTTGTAGATT 


AGCCCTGGAG 


TCCTGTCAAG 


GACAGTCTGG 


TTCAGGTAGC TGTCTACTGA 


1500 


TCCTT TCAGA 


ACTTCCCTGT 


CTTATTCATA' 


GAAATAACAG 


TGAGAGACAA GCCATTGGGC 


1560 


TTGACTTTTT 


CCTTTTAAGA 


TTTCGGTCTA 


ACAATTTATC 


TGTGAAAAAC CTTTAAAATA 


1620 


TAAAACATAT 


TGATTAGTTC 


TTTAAACCTG 


AGTGATAATT 


TTCTTACAGG AAGAAATATC 


1680 


CGTTT TACCC 


TAAAAATTAG 


ATTGGTACCC 


AAATGCCAGT 


GTATGAAGGT GTTGGGTCAA 


1740 


GAAAACACAA 


AAAAACTGTT 


AGAATATGGT 


CTACATCaAA 


AfTCCTATAT GTGATTAACA 


1800 


CTTGTTAAAC 


ATCTrATCTC 


CATGTGTTTG 


GGGrTGATCA 


CTGTCCTGCC TGTGATGTCA 


1860 
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CCCACACAGC AAACCTACTC TCTACCATGC ACAGGACATC TTCATGGGGT AGTTCACTGT 192Q 

TACACACTAC TGCCCTCCTT ACTTCATGCC TGAf GCTTTC TTGTTTCCTC AGAGATAGGA 1980 

TGGCCATTAA AGCAGTGCCA TTGCCGAAAG TTCTGCAGAC CTTATGAAAA GGCCGAGGGG 2040 

TCCTGTCGTC CAGGTCTATT TATAAAACGC AAAATCTGCT GCATACAACA ATGGACACCA 2100 

GGGAGGACAT AACCACGTGA ACTGGGACCT CACAATCTGT CATTCTTGGG CTTCAACTCG 2160 

ACTGCTTTTC CTTCTCCAAT AAACCCCTTG CAGACAAATA ACCTGTTTAT GTTTTTTTGA 2220 

TGCTTTCTAT GTGGCGTAGA CAGGACTCTC CTGAGCCATG TAGCAAAATC TTCACTGAAf 2260 

CCT7TGTAAA AGAAGTCTTG GTCACA777C AGCAGTCA7A TCAAGGATGA GCAGGAGGTT 2340 

AGATCCAAAG AGACAAGATG GTCTGCGCCA GCTGCTTCTC TGTCTATCAA GTCTTCTGTC 2400 

CTTTAGATTA GAGTCACCCT CAAAAATTAG TTCCAGATTT TCATGTTCTA TTTTTTC 2457 
(2) INFORMATION FOR SEQ ID NO:69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 2403 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEONESS: single 
(0) TOPOLOGY: linear 



(xi) SEQUENCE OESCRtPftON: SEQ 10 NO: 69: 



TATTACGAAT 


TCGAGCTCGG 


7ACCGGTATA 


TGAAGAGCGA CCACTGCCAG GACGAAAG7G 


60 


CAATGCGGCA 


TACCTCAGTG 


GCGTGGAGTG 


CAGGTATACA GATTAATCCG GCAGCGTCCG 


120 


TCGTTGTTGA 


TATTGCTTAT 


GAAGGCTCCG 


GCAGTGGCGA CTGGCGTACT GACGGATTCA 


lao 


TCGTTGGGGT 


CGGTTATAAA 


TTCTGATTAG 


CCAGGTAACA CAGTGTTATG ACAGCCCGCC 


240 


GGAACCGGTG 


GGCTTTTTTG 


TGGGGTGAAT 


ATGGCAGTAA AGATTTCAGG AGTCCTGAAA 


IQO 


GACGGCACAG 


GAAAACCGGT 


ACAGAACTGC 


ACCATTCAGC TGAAAGCCAG ACGTAACAGC 


360 


ACCACGGTGG 


TGGTGAACAC 


GGTGGGCTCA 


GAGAATCCGG ATGAAGCCTG CTTTTTTATA 


420 


CTAAGTTGGC 


ATTATAAAAA 


AGCATTGCTT 


ATCAATTTGT TGCAACGAAC AGGTCACTAT 


430 


CAGTCAAAAT 


AAAA7CAT7A 


777GA7T7CA 


A77T7G7CCC ACTCCCTGCC 7C7G7CA7CA 


540 


CGATACTGTG 


ATGCCATGGT 


GTCCGACTTA 


TGCCCGAGAA GATGTTGAGC AAACTTATCG 


600 


CTTATCTGCT 


TCTCATAGAG 


TCTTGCAGAC 


AAACTGCGCA ACTCGTGAAA GGTAGGCGGA 


660 


TCTGGGTCGA 


CTCTAGGCCT 


CACTGGCCTA 


ATACGACTCA CTATAGGGAG CTCGAGGATC 


720 


ATTGCTAATA 


CCATGAAACT 


TGACCACCTG 


GTCAAGGACT GGTCCAGGCT CCACAGTTTC 


780 


TGAGAAGAGC 


AGGCTCCAAC 


TTCTAACCAC 


AAAAACTATT TTTTCCATGC GCTCCTTAAA 


840 


TTAGGCAGCG 


CCCAGCTATT 


TTCTTTCCTG 


ACCACTGAGA GGTAAATACT CAAGCAGATG 


900 


GGAAACAGGG 


GAAGATAGCA 


AGGCCTCTTC 


ATCATTATCA CTGGGTGTGT GCGTGAGGGG 


960 


AGGGG7G7CA 


77GCA7ACAC 


AGGGCAACA7 


CAGGATGCAA GCCT7GAGGA CAGAGGAACA 


1020 


TCAAAGGGAT 


CCTGAGGACA 


ACAGCTGGGA 


GCAGTTGCCA TCAGTGAGTG CCTTCTCTAA 


1Q80 


GTGTGGGGCC 


TTTCTCTGCC 


ACATAAATGC 


AGGCTGCCTC CTCTCTCCAC ACACTGGTCT 


1140 


CCAGCTCACC 


AA7CCTCCAG 


GTGACTTCCA 


GCCATCAAGA CTCTTGTCCT CCTCTCTGCC 


1200 


CTTGTCCTGG 


TGGCCTACCA 


GGTCCAGGCT 


GATCCCATTC AAGGGGCAGA AGAAGAGACT 


1260 
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AAAACTGAAG AGCAACCA7C AGATGAGGAC CAGGATGTGT CTGTCTCCTT TGAAGGCCCA 1320 

GAAGCCTCTG CTCTTCAAGA TTTTGGTGAG TGCTTATGCA CAGAATGATG GAGGCTTGGA 1380 

GTCTCCTGAT GGAGGGTTGT AGATTAGACC TGGAATCCTG TCAAGAACTG TCTGGTTCAG 1W0 

GTAGCTGTCT CTTGGTCCCT TTACATTCCT TGTCTTCTTC ATAGAACTAA CGGAGAGAGA 1500 

TTAACCATTG GGCTTGACT7 TTTTCCTTTT AAAATTTTTG ATCTAACAAT TTATCTGTGG 1S60 

AAAACCTTTA AAATATAAAA CATATTGATT AGTTCTTTTA GACCTGATTG ATAATTTTGT 1620 

TATAAGAAGA AATAfTCGTT CTACTTTAAA AATTAGATTT GGGACCCAAA TGCCAGTGTA 1680 

TGAAGCTGTT GGGTAAGGAA AAACCAAAAA TGGTGATAGA ATGTTGTGTA GATGACAATT 1740 

CCTTTATGCG ATTAACAC77 TTTAAAATGT CTTATCTCCA TGTG777GGG GTTGATCATG 1800 

GTGCTGACTG TGATGTCACC CACAGAGCAA ACCTACTCTC TACCATGCAC AGGACATCTT 1860 

CATAGGGTAG TTCACTGTCA CACACTGCTG GCCTCGTTAC TTCATGCCTG ATGCTTTCTT 1920 

GTTTCCTCAG AGATAGGAAG GCCAGTGAGG AGGTGCCGTT GCAGAGCAAA CTGCGGACCT 1980 

AAAGAATATG CCACTGCGTT CTGTGCTCAA GGTCCATTTA AACAGTTCAA ATTCTGCTGC 2040 

ACATGAACAT GGATCCCAAG TCTGAGATAA CCACGTGCTC TGGGACCTCA CAATCTGTCA 2100 

TTATTGTGCT TGACCTCAAC TGCTTTTCCT TCTCCAATAA ACTCC7GGCA GACAAATAAT 2160 

CGGTATATGT TTATTTGATG CTTTCTATTT GGCTTAGACA GAACTCTCCT GAGCCATGTA' 2220 

GCTGAATCTT CAGTGAATCC TTTGTAAAGC TCACA7TTCA GCAGTCATAT CAAGGATGAG 2280 

CAGGAGGTTA GATACAAAGA GACAAGATGG TCTGCGCCAG CTGCTTCTTT GTCTATCAAG 2340 

TCTGCTTTCC TTTAGATTAG AGTCACCATC AAAAATTATT CCCACATTTT CATGT7CTAT 2400 

A777T77T 2408 

(2) INFORMATION FOR SEQ 10 NO:70: 

(i) SEQUENCE CHARACTER I S7 ICS: 

(A) LENGTH ; 2551 base pairs 
CB) TYPE: nucleic acid 
(C) S7RAN0EONESS: single 
(0) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEO 10 N0:70: 

CCTGAGACCA ACTCTGTGAT AATCAGAAAA GACAATTATG TGTCTTAAAT GTAAGGTTTG 60 

CTTCT7GAC7 GATAGATCTA ACCCTACAGA GAGATTCAAG 7GG7CT7G7C CCATTGAACA 120 

A7AGTATATA TGTTTTATAT ATATATATAT ATATATGTAT ATGTATATAT ATATGTGTGT . 180 

GTGTGTGTGT GTGTGTCTG7 GTC7G7G7GT C7G7G7GTC7 GTG7GTCTGT GTGTCTGTGT 240 

GTGTATGTGT GTGTA7G7G7 ACA7ATGTTC AATATGTCTG TAAAATAGTA TTCTTGTAGC 300 

TTCACTTACT TTTGCACAGA GCTGTAAATA ACAACATTGT AGCCAATGAA TAGTATTTAT 360 

TAACATGTAA ATAGGAGCTG GCACCTCTGA CAGTGGGACT CCATACAGTG ACTGTAAACA 42Q 

ACACGATGCT CTAGACCTTT TGCTGTGTGT GTGGTGAGAG ACATGGGATA AACACAGACT 480 

GAAGTGTATG ACATGGCGGC ACTTCTCGAG ACCGGGTAGC AGCTTCTGAC CCTCTCTACA 540 

T TGTGGA TGT CCTTTCCTGT AGGTCAGGTC TCATTGTCTA AAAGTAAAAG CATTGCAGCA 600 

TC7CAGACCT CGCAAACACC CCATGGCTTG AGCCTCCCGC AGGTGAAGAG CCACCTGGAG 660 
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CTCACTC77C 


GCAGA7G7G7 


7CCA7GAC77 7GGC77CT7C AGAACCACCC AC7ACAGC7T 


720 


CACTCTGACA 


AATCTTAGAA 


ACTTGAACTC AATTCACTGG AGGGCACAAT AAAGCCA7CT 


rao 


TACTTTCTCT 


AAAATGGCCC 


CAAAGGAGGG GATTCACAAA G7TTGCCTTG ATGAGGACCA 


840 


TTGCTAATAC 


CCCAAAACTT 


GCAAAAAAAA TTGAGTGTCC AGTCAACCTG GTCAAGGACT 


900 


CGTCCTCGAT 


CCACAGTTTC 


TGAGAAAAGA AGGC7CCAAC 7TCAAAACAC AAACCAC7CC 


960 


TCTTCTATCC 


GCTCATTAAA 


7TAGGCAGTG TTAAGCTATT 77CTTTCC7G ACCACTGAGA 


1O20 


GGTAAATACT 


CAAGCAGATG 


GGAAACAGCG GAGGACAGCA AAGCCTGTTC ATCATTA7CA 


1080 


GTCGGAGTCT GCGTGAGGGG 


AGGGGTG7CA GTGAACACAC AGAGCA7CAG GAAGGAAGCC 


1140 


TTGAGGACAG 


ACGAACATCA 


AAGGGA7CCT GAGGACAACA GCTGGGAGCA GTTGGCA7CA 


1200 


C7GAG7GCCC 


7C7C7AAGTG 


7GGGGCC777 CrCTGCCACA 7AAA7GCAGG C7GGC7CC7C 


1260 


TCTCCACACA 


CTGGTCTCCA 


GCTCACCAAT CCTCCAGGTG ACTTCCAGCC ATGAAGACTC 


1320 


TTGTCCTCCT 


CTCTGCCCTT 


GTCCTGCTGG CATTCCAGAT CCAGGCTGA7 CCCAT7CAAG 


1380 


AGGCAGAAGA 


AGAGACTAAA 


ACTGAGGAGC AGCCAGCAGA TGAGGACCAG GA7GTGTCTG 


1440 


TCTCCTTTGA 


AGGCCCAGAA 


CCCTCTCCTC TTTAAAATTT AGGTGCGTGC TTGTGCACAG 

WLL 1 v» > UL 1 L 11 l«AA/W 1 1 ■ nUU \ UVU 1 UL 1 1 U 1 UW\W«U 


15Q0 


AATCATGGAG 


GCTTGGAGTC 


t r n cat r.r.A r.r.r.T t Ar.A TTAftrrrtnn agtcctgtpa 


1560 


ACGACAGTCT 


GGTTCAGGTA 


KPTr.TrTATT CATrrTTTTA (VAArTTfTfT HTCTTATTCA 

VJL lUlWIni 1 UA 1 LL 1 I 1 LA Un^L 1 1 tUU 1 Ull>l mi 1 V»/* 


1620 


TAGAAATAAC 


AGTGAGAGAC 


MuCLA 1 1 uu uL 1 1 UAL ill 11 LL lilt RM UA 1 1 1 I uu 1 V. 


1630 


TAACAATTTA 


TC7G7GAAAA 


irrTTTiiii TATAAAATAT A T Tft A T T AT. T TTTTTTAAAT 
ALU 1 1 • AAAA 1 A 1 AAAALA 1 AM UA 1 \ AU 1 1 L I • ' I AAAL 


1740 


C7GA77GATA A7777G77A7 


&rr i Af*A a a t A&rxc.TTrf x cttt aaaaaT taciatttcht 

AuuAAkiAAA 1 AAL 1 U 1 1 L 1 A L M 1 AAAAA t I AuA 1 1 1 uu \ 


T80G 


ACCTAAATGC 


CAGTGTATTA 


hrnrvjrrr TrirfiiiiP &.r aaT a lt(T Tf.AT AT a ATf. 
AbUl Ul 1 uuu ILauuAAAaL ALAAIaaIuL 1 UA 1 ALAA 1 L 


1860 


TGGT GTAGAT 


GACAA7TCCT 


iTITfrriTT iirirTTTTT AAATTPTTrT ATPTTrATftT 
AiAluLUAll AALALlluM AAAIlUlLLI AILlLLAIul 


1920 


— — GTTTCGGGTT 


GATCATGGTG 


L t (JUL 1 L • UA 1 U I l»ALLLAw ALAUUrArVU 1 lilt.' «W 


1980 


CATGCACAGG 


ACATCTTCAT 


Af.r.r.T AC.TTC ArTC.TC ACAC ACTGGTGGCC TCCTTACTTC 
AuuL i Au 1 1 L AL 1 U 1 LALAL AL 1 LL 1 LULL 1 UL l l Ak. I » w 


2040 


ATGCCTGATG CTTTCTCGTT 


Trn r Anr.t jrACAir.r.rr a tt.ca at. a r.r. tggcattgca 

I LL 1 LALvAuA 1 LA LA 1 LULL A 1 UUAAUAUL 1 UUL.A 1 1 ULn 


2100 


GAAGTTTCTG 


CAGACCTTAT 


GAAAATGCCA C7TCGT7CTG TGCTCAAGG7 C7ATT7AAAC 


2160 


AACACAAATT 


CTGCTGCCTA 


GAAACA7GGC CCCCAAGGAT GAAA7AACCA CGTGCTCTGG 


2220 


GACCTCACAA 


TCTGTCATCA 


TTG7GCT7GG CCTCAAC77C TTTTCCTTC7 CCAATAAAC7 


2280 


CCTTGCAGAC 


AAATAACCTG 


777A7G7777 77TGA7GCT7 TC7A7G7GGC T7AGACAGGG 


2340 


CTCTCCTGAG CCATGTAGCA 


GAA7CTTCAG 7GAA7CCTT7 GTAAAAGAAG TC77CG7CAC 


2400 


ATTTCAACAG 


TCATATCAAG 


GA7GAGCAGG AGGTTAGATC CAAAGAGACA AGA7GCTCTG 


2460 


CTCCAGCTGC 


TTCTTGACTA 


7CAAC7C7TC TG7CCTTCAG A7TAGAGTCA CCCICAAAAA 


2520 


rtAGTcccAC CTrrrcArcr 


7C7A7T77T7 7 


2551 
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WE CLAIM: 

1. A substantially purified cryptdin peptide 
of enteric origin having an amino acid sequence as 

follows: 

5 X 1 -C-X 2 -C-R-X 3 -C-X,<E-X S -C-X 6 -C-C-X 7 

wherein X x is 3 to 9 amino acids; 

X 2 is 1 amino acid; 

X 3 is 2 or 3 amino acids; 

X< is 3 amino acids; 

10 X 5 is 5 amino acids; 

X 6 is 6 to 10 amino acids; and 

X 7 is 0 to 9 amino acids. 

.2. A substantially purified mouse cryptdin 
15 peptide of enteric origin having an amino acid sequence 
as follows : 

X l -L-X 2 -C-Y-C-R«X 3 -C-K-X,-E-X 5 -G-T-C-X 6 -C-C-X 7 
wherein X : is 3 or 4 amino acids, preferably LRD, LSKK 
(SEQ ID NO: 8) or LRG; 
20 X 2 is 1 amino acid, preferably V, L or I; 

X 3 is 3 amino acids, preferably KGH or *RG, 
where * is S, T, K, I or A; 

X 4 is 2 amino acids, preferably GR, RR or RG; 

X 5 is 3 amino acids, preferably RMN, RVR, RVF 
2 5 HMN or HIN; 

X 6 is 6 to 9 amino acids, preferably . GIRFLY (SEQ 
ID NO: 3), RRGHLMYTL (SEQ ID NO: 59) or RtfLFLTFVF (SEQ ID 
NO. : 4) or RKGHL* YT* (SEQ ID NO: 5), where * independently 
is L or M; and ■ ' r 

30 X 7 is 0 to 3 amino acids, preferably R, S or 

PRR . 

3. The substantially purified cryptdin peptide 
of claim 2, wherein X> is selected from the .groups 
35 consisting of LRD , G and LSKK (SEQ ID NO: 8) . 
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4. The substantially purified cryptdin of 
claim 2, wherein X 2 is selected from the group consisting 
of V, L and I . 

S 5. The substantially " purified cryptdin' of 

claim 2, wherein X 3 is selected from the group consisting 
of KGH and *RG , wherein * is selected from the group 
consisting of S, T, K, I and A. 

10 -6. The substantially purified cryptdin of- 

claim 2, wherein X, is selected from the group consisting 
of GR, RR and RG. 

7. The substantially purified cryptdin of 
15 claim 2, wherein X s is selected from the group consisting 

of RMN, RVR, RVF, HKN and HIN . 

8. The substantially purified cryptdin of 
claim 2, wherein X 6 is selected from the group consisting 

20 of GIRFLY (SEQ ID NO: 3), RNLFLTFVF ■ (SEQ ID NO: 4), 
RRGHLMYTL (SEQ ID NO: 59) and RKGHL*YT* (SEQ ID NO: 5), 
wherein * indicates L or M independently. 

9. The substantially purified cryptdin of 
25 claim 2, wherein X 7 is selected from the group consisting 

of R, S and PRR. 

10. The substantially purified cryptdin of 
claim 2, wherein, the amino acid sequences X L , L , X 2 are 

30 absent. 
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11. A substantially purified cryptdin peptide 
' of enteric origin having an amino acid sequence selected 

from the group consisting of: 

5 GLLCYCRKGHCKRGERVRGTCG I RFL YCCPRR (SEQ ID NO: 15); 

LSKKLICYCRIRGCKRRERVFGTCRNLFLTFVFCCS (SEQ ID NO: 16); 
LRDLVCYCRARGCKGRERMNGTCRKGHLLYMLCCR (SEQ ID NO: 17); 
LKQCHCRKFCRPYEKAEGSCRPGLFIKRKICCIQQWTPGRT (SEQ ID 
NO: 18) ; 

10 IGRPVRRCRCRANCGPKEYATAFCAQGPFKQFKFCCT (SEQ ID NO: 19); 

IRWPWKRCHCRSFCRPYENATSFCAQGLFKQHKFCCLDTWPPRMK (SEQ ID 
NO: 20) ; 

TSGSQARATCYCRTGRCATRESLSGVCE I SGRL YRLCCR (SEQ ID 
NO: 21) ; and 

15 AFTCHCRRSCYSTEYSYGTCTVMGINHRFCCL (SEQ ID NO: 22). 

12. A pharmaceutical composition, comprising 
a cryptdin peptide and a physiologically acceptable 
carrier. 

20 

13. A method for detecting an inflammatory 
pathology in a subject, comprising the steps of: 

a. determining the amount of a cryptdin 
in a biological sample from the subject; and 
25 ■■ b . comparing said amount to the mean 

amount in a normal subject, wherein a 
significant deviation from said mean amount in 
a normal subject is indicative of an 
inflammatory pathology in said subject. 

30 

14. The method of claim 13, wherein the 
presence of said cryptdin is determined by contacting 
said biological sample with a detectable ant i -cryptdin 
antibody and detecting binding of said ant i -cryptdin 

35 antibody to said biological sample. 
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15. The method of claim 13, wherein said 
biological sample is selected from the group consisting 
of intestinal tissue and the contents of the intestinal 
lumen , 

5 

IS. The method of claim 13, wherein said, 
inflammatory pathology is selected from the group 
consisting of inflammatory bowel disease, pancreatitis, 
malignancy, infection and ileitis. 

10 

17. A method for treating inflammation of the 
intestine in a subject, comprising administering a 
cryptdin peptide to the subject, 

15 18. The method of claim 17, wherein' said 

subject is immunocompromised. 

19. The method of claim 18, wherein said 
subject is immunocompromised due to malignancy, 

20 malnutrition, radiation burns, immunosuppressive 
infections, autoimmune disease or neonatality, bone 
marrow transplantation or chemotherapy. 

20. 'The method of claim 17, wherein said 
25 cryptdin is administered by a means selected from the 

group consisting of oral administration, nasogastric 
intubation, transabdominal 'catheterization, intravenous 
administration, aerosol inhalation and topical 
administration. 

30 

21. The method of claim 17, wherein more than 
one cryptdin is administered ' simultaneously or 
sequentially. 
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22. The method of claim 17, wherein said 
cryptdin is administered orally in a delayed release 
formulation designed to permit release in the small 
intestine. , 

5 

23. An anti-cryptdin antibody. 

24. The anti-cryptdin antibody 'of claim 23, 
wherein said antibody is a monoclonal antibody. 

10 

25. In a method for chemically synthesizing a 
peptide* by attaching an amino acid 'to a resin, 
sequentially coupling additional amino acids to obtain a 
protected peptide resin, cleaving the protected peptide 

15 from the resin and deprotecting the peptide, the 
improvement comprising prior to cleavage ~ and 
deprotection, reswelling the protected peptide resin with 
dichloromethane . 

20 26. The method of claim 25, wherein said 

peptide is a cryptdin. 

27. A method for preventing inflammation in a 
subject as a result of surgery, comprising administering 

25 a .cryptdin' to said subject prior to said surgery. 

28. A substantially purified nucleic acid 
molecule encoding a cryptdin. 

30 29. The nucleic acid molecule of claim 28, 

wherein said nucleic acid molecule is a cryptdin gene or 
a portion thereof. 

30. A nucleic acid molecule, comprising a gene 
35 selected from the group consisting o£ the mouse cryptdins 
1, 2, 3, 5, 6 and i genes as shown in Figure 11 (SEQ ID 
NOS : 5 3 - 5 8 ) . 
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31. A nucleic acid molecule, comprising a gene 
selected from the group consisting of the rat cryptdins 
1, 2 and 3 genes as shown in Figures 15. A. to 15. C. (SEQ 
ID NOS: 66-68) . 

5 

32. The nucleic acid molecule, of claim 28, 
wherein said nucleic acid molecule is a cryptdin cDNA - 
sequence or a portion thereof. 

10 3 3. A nucleic acid molecule, comprising a cDNA 

sequence selected from the group consisting of the mouse 
cryptdins 1-17 cDNA sequences as shown in ' Figure 10 {SEQ 
ID NOS: 34 -50) . 

15 34. A nucleic acid molecule, comprising a cDNA 

sequence selected from the group consisting o.f_ the mouse 
cryptdins 2-17 cDNA sequences as. shown in Figure 10 (SEQ 
ID NOS: 35-50) . 

20 3 5. A nucleic acid molecule, comprising a cDNA 

sequence selected from the group consisting of the rat 
cryptdins 1, 2 and 3 cDNA sequences as shown in Figures 
14. A. to 14. C. (SEQ ID NOS : 63 -65). 

25 36. A nucleotide sequence that can hybridize 

under relatively stringent conditions to a nucleic acid 
molecule encoding a cryptdin. 

37. The nucleotide sequence of claim 36, 
.30 comprising a portion of a nucleic acid molecule selected 
from the group consisting of the mouse cryptdins 2-17 
cDNA sequences as shown in Figure 10 (SEQ ID NOS: 34-50); 
the mouse cryptdins 1, 2, 3, 5, 6 and i genes as shown in 
Figure 11 (SEQ ID NOS : 53 -58); the rat cryptdins 1, 2 and 
35 3 genes .as shown in Figures 15. A. to 15, C. (SEQ ID NOS : 
66 -68) and the rat cryptdins 1, 2 and 3 cDNA sequences as 
shown in Figures 14. A. to 14. C. (SEQ ID NOS : 63-65). 
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38. A method of detecting the presence of a 
nucleic acid molecule encoding a cryptdin in a biological 
sample, comprising the steps of: 

a. contacting the biological sample with 
the • nucleotide sequence of claim 35 under 
relatively stringent hybridization conditions; 
and 

b. detecting hybridization of said 
nucleotide sequence to a nucleic acid molecule 
present in said sample, wherein said 
hybridization indicates the presence of a 
•.nucleic acid molecule encoding a cryptdin. 
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FIG. 12 A 



FIG. 12B 



FIG. 12C 
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FIG, 13A 



FIG. 13B 



FIG. 13C 
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RAT CRYPTIN 1 cDNA SEQUENCE 



10 20 30 40 50 

1234567890 1234567890 1234567890 1234567890 1234567890 

ACACTGGTCT CCAGCTCACC AATCCTCCAG GTGACTTCCA GCCATGAAGA 50 

CTCTTGTCCT CCTCTCTGCC CTTGTCCTGC TGGCATTCCA GGTCCAGGCT 100 

GATCCCATTC AAGAGGCAGA AGAAGAGACT AAAACTGAGG AGCAGCCAGC 150 

AGATGAGGAC CAGGATGTGT CTGTCTCCTT TGAAGGCCCA GAACCCTCTG 200 

CTCTTCAAAA TTTAGAGATA GGATGGCCAT TAAAGCAGTG CCATTGCCGA 250 

AAGTTCTGCA GACCTTATGA AAAGGCCGAG GGGTCCTGTC GTCCAGGTCT 300 

ATTTATAAM CGCAAAATCT GCTGCATACA ACAATGGACA CCAGGGAGGA 350 

. CATAACCACG TGAACTGGGA CCTCACAATC TGTCATTCTT GGGCTTCAAC 400 

TCGACTGCTT TTCCTTCTCC AATAAACCCC TTGCAGACAA AAAAA 445 

FIG. 14B 

RAT CRYPTIN 2 cDNA SEQUENCE 

10 20 30 40 50 

1234567890 1234567890 1234567890 1234567890 1234567890 

ACACTGGTCT CCAGCTCACC AATCCTCCAG GTGACTTCCA GCCATGAAGA 50 

CTCTTGTCCT CCTCTCTGCC CTTGTCCTGG TGGCCTACCA GGTCCAGGCT 100 

GATCCCATTC AAGGGGCAGA AGAAGAGACT AAAACTGAAG AGCAACCATC 150 

AGATGAGGAC CAGGATGTGT CTGTCTCCTT TGAAGGCCCA GAAGCCTCTG 200 

CTCTTCAAGA TTTTGAGATA GGAAGGCCAG TGAGGAGGTG CCGTTGCAGA 250 

GCAAACTGCG GACCTAAAGA ATATGCCACT GCGTTCTGTG CTCAAGGTCC 300 

ATTTAAACAG TTCAAATTCT GCTGCACATG AACATGGATC CCAAGTCTGA 350 

GATAACCACG TGCTCTGGGA CCTCACAATC TGTCATTATT GTGCTTGACC 400 

TCAACTGCTT TTCCTTCTCC AATAAACTCC TGGCAGACAA AAAAA 445 



FIG. 14C 

' RAT CRYPTIN 3 cDNA SEQUENCE 

10 20 ~30 40 ' 50 

1234567890 1234567890 1234567890 1234567890 1234567890 

ACACTGGTCT CCAGCTCACC AATCCTCCAG GTGACTTCCA GCCATGAAGA 50 

CTCTTGTCCT CCTCTCTGCC CTTGTCCTGC TGGCATTCCA GATCCAGGCT 100 

GATCCCATTC AAGAGGCAGA AGAAGAGACT AAAACTGAGG AGCAGCCAGC 150 

AGATGAGGAC CAGGATGTGT CTGTCTCCTT TGAAGGCCCA GAACCCTCTG ' 200 

CTCTTCAAAA TTTAGAGATC AGATGGCCAT GGAAGAGGTG CCATTGCAGA 250 

AGTTTCTGCA GACCTTATGA AAATGCCACT TCGTTCTGTG CTCAAGGTCT 300 

ATTTAAACAA CACAAATTCT ' GCTGCCTAGA AACATGGCCC CCAAGGATGA 350 

AATAACCACG TGCTCTGGGA CCTCACAATC TGTCATCATT GTGCTTGGCC 400 

TCAACTTCTT TTCCTTCTCC AATAAACTCC TTGCAGACAA AAAAA 445 
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FIG. 15A(1) 

RAT CRYPTIN 1 GENOMIC SEQUENCE 

10 20 30 . 40 50 

1234567890 1234567890 1234567890 1234567890 1234567890 

CCTGAGACCA ACTCTGTGAT AATCAGAAAA GTCAATAATG TGTCTGAAAT 50 

GTAAGGTGTG CTTCTTGACT GATAGTTCTA AGCCTACAGA GAGATTCATG 100 

TGGTCATATC CCATTTAACA ATGATATATA TGTTAAATAT ATAAAGATAT 150 

ATGTATGTTC AGTATGTATG TTCAATATGT ATGTAAATAA TATTCTTGCT 200 

GCTTCACTAG CTTTTACACA GAGCTGTAAG TAAAAACATT GTAGCCAATG 250 

AATAGTATTT ATTAACATGT AAATAGGAGC TGGCACCTGT GACAGTGGGA 300 

CTCCATACAC TGACTGTAAA CAACAGGATG CTCTGGACCT TTTGCTGTGT 350 

GTGTGGTGAG AGACATGGGA TAAACACAGA CTGAAGAGTG TTCCTGAATG 400 

ACATGGCGGC ACTTCTCGAG ACCGGGTAGC AGCTTCTGAG CCTCTCTACA 450 

TTGTGGATGT CCTTTCCTGT AGGTCAGGTC TCATTGTCTA AAAGTAAAAG 500 

CATTGCAGCA TCTCAGACCT GGGAAACACC CCATGGCTTG AGGGTCCTGA 550 

GCATGAAGAG CCACCTGGAG CTCACTCTTG GCAGATGTGT TCCATGACTT 600 

TGGCTTCTTC AGAACAACCC ACTACAGCTT CACTCTGACA AATCCTAGAA 650 - 

ACTTGAACTC AATTCACTAG AGGGCACCAT AAAGCCATCA TACCTTATAA 700 

TGGCCCCAAA GGAGGTGATT CACAAAGTTT GCCTTGATGA GGACAATTGC 750 

TAATACACAA AAACTTGCAA AAAAAAATTG AGTGTCCAGT CCACCTGGTC 800 

AAGGACTGGT CCCGGATCCA CAGTTTCTGA GAATAGCAGG CTCTAACTTG 850 

AAAACACAAA AATTGTTTGT TCTATGAGCT CATTAAATTA GGCAGTGTTC 900 

AGCTATTTTC TTTCCTGACC ACTGAGAGGT AAATACTCAA GCAGATGGGA 950 

AACAGGGGAG GACAGTAAAG CCTGTTCATC ATTATCAGTG GGAGTGTGCA 1000 

TGAGGGGAGG GGTGTCAGTG AACACACAGA GCATCAGGAA GGAAGCCTTG 1050 

AGGACAGAGG AACATCAAAG GGATCCTGAG GACAACAGCT GGGAGCAGTT 1100 

GCCATCAATG AGTGCCTTCT CTAAGTATGG GGCATGTTCT TTGCCCTATA 1150 

AATGCAGGCT GGCTTCTCTC TCCACACACT GGTCTCCAGC TCACCAATCC 1200 

TCCAGGTGAC TTCCAGCCAT GAAGACTCTT GTCCTCCTCT CTGCCCTTGT 1250 

CCTGCTGGCA TTCCAGGTCC AGGCTGATGC CATTCAAGAG GCAGAAGAAG 1300 

AGACTAAAAC TGAGGAGCAG CCAGCAGATG AGGACCAGGA TGTGTCTGTC 1350 

TCCTTTGAAG GCCCAGAACC CTCTGCTCTT CAAAATTTAG GTGCGTGCTT 1400 

GTGCACAGAA TGATGGAGGC TTGGAGTCTC CTGATGGAGG GTTGTAGATT 1450 

AGCCCTGGAG TCCTGTCAAG GACAGTCTGG TTCAGGTAGC TGTCTACTGA 1500 

, TCCTTTCAGA ACTTCCCTGT CTTATTCATA GAAATAACAG TGAGAGACAA 1550 

GCCATTGGGC TTGACTTTTT CCTTTTAAGA TTTCGGTCTA ACAATTTATC 1600 

TGTGAAAAAC CTTTAAAATA TAAAACATAT TGATTAGTTC TTrAAACCTG 1650 

AGTGATAATT TTCTTACAGG AAGAAATATC CGTTTTACCC TAAAAATTAG 1700 

ATTGGTACCC AAATGCCAGT GTATGAAGGT GTTGGGTCAA GAAAACACAA 1750 

AAAAACTGTT AGAATATGGT GTAGATGAAA ATTCCTATAT GTGATTAACA " 1800 

CTTGTTAAAC ATCTTATCTC CATGTGTTTG GGGTTGATCA CTGTGCTGGC 1850 ■. 

TGTGATGTCA CCCACACAGC AAACCTACTC TCTACCATGC ACAGGACATC 1900 
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FIG. 15A(2) 



RAT CRYPTIN 1 GENOMIC SEQUENCE 

~1Q 20 30 40 50 

1234567890 1234567890 1234567890 1234567890 1234567890 

TTCATGGGGT AGTTCACTGT TACACACTAC TGGCCTCCTT ACTTCATGCC 1950 

TGATGCTTTC TTGTTTCCTC AGAGATAGGA TGGCCATTAA AGCAGTGCCA 2000. 

TTGCCGAAAG TTCTGCAGAC CTTATGAAAA GGCCGAGGGG TCCTGTCGTC 2050 

CAGGTCTATT TATAAAACGC AAAATCTGCT GCATACAACA ATGGACACCA 2100 

GGGAGGACAT AACCACGTGA ACTGGGACCT CACAATCTGT CATTCTTGGG 2150 

CTTCAACTCG ACTGCTTTTC CTTCTCCAAT AAACCCCTTG CAGACAAATA 2200 

ACCTGTTTAT GTTTTTTTGA TGCTTTCTAT GTGGCGTAGA CAGGACTCTC 2250 

CTGAGCCATG TAGCAAAATC TTCAGTGAAT CCTTTGTAAA AGAAGTCTTG 2300 

GTCACATTTC AGCAGTCATA TCAAGGATGA GCAGGAGGTT AGATCCAAAG 2350 

AGACAAGATG GTCTGCGCCA GCTGCTTCTG TGTCTATCAA GTCTTCTGTC 2400 

CTTTAGATTA GAGTCACCCT CAAAAATTAG TTCCAGATTT TCATGTTCTA 2450 

TTTTTTC 2457 
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FIG. J 58(1 J 

■ RAT CRYPTIN 2 GENOMIC SEQUENCE 

~~ 10 20 30 40 50 



1234567890 1234567890 1234567890 1234567890 1234567890 



TATTACGMT 


TCGAGCTCGG 


TACCGGTATA 


TGAAGAGCGA 


CCACTGCCAG 


50 


GACGAAAGTG 


CAATGCGGCA 


TACCTCAGTG 


GCGTGGAGTG 


CAGGTATACA 


100 


GATTAATCCG 


GCAGCGTCCG 


TCGTTGTTGA 


TATTGCTTAT 


GAAGGCTCCG 


150 


GCAGTGGCGA 


CTGGCGTACT 


GACGGATTCA 


TCGTTGGGGT 


CGGTTATAAA 


200 


TTCTGATTAG 


CCAGGT AACA 


CAGTGTTATG 


ACAGCCCGCC 


GGAACCGGTG 


250 


GGCIM IIIG 


TGGGGTGAAT 


ATGGCAGTAA 


AGATTTCAGG 


AGTCCTGAAA 


300 


GACGGCACAG 


GAAAACCGGT 


ACAGAACTGC 


ACCATTCAGC 


TGAAAGCCAG 


350 


ACGTAACAGC 


ACCACGGTGG 


TGGTGAACAC 


GGTGGGCTCA 


GAGAATCCGG 


400 


ATGAAGCCTG 


UIIIIIATA 


CTAAGTTGGC 


ATTATAAAAA 


AGCATTGCTT 


450 


ATCAATTTGT 


TGCAACGAAC - 


AGGTCACTAT 


CAGTCAAAAT 


AAAATCATTA 


500 


TTTGATTTCA 


ATTTTGTCCC 


ACTCCCTGCC 


TCTGTCATCA 


CGATACTGTG 


550 


ATGCCATGGT 


GTCCGACTTA 


TGCCCGAGAA 


GATGTTGAGC 


AAACTTATCG 


600 


CTTATCTGCT 


TCTCATAGAG 


TCTTGCAGAC 


AAACTGCGCA 


ACTCGTGAAA 


650 


GGTAGGCGGA 


TCTGGGTCGA 


CTCTAGGCCT 


CACTGGCCTA 


ATACGACTCA 


700 


CTATAGGGAG 


CTCGAGGATC 


ATTGCTAATA 


CCATGAAACT 


TGACCACCTG 


750 


GTCAAGGACT 


GGTCCAGGGT 


CCACAGtTTC 


TGAGAAGAGC 


AGGCTCCAAC 


800 


TTCTAACCAC 


AAAAACTATT 


TTTTCCATGC 


GCTCCTTAAA 


TTAGGCAGCG 


850 


CCCAGCTATT 


TTCTTTCCTG 


ACCACTGAGA 


GGTAMTACT 


CAAGCAGATG 


900 


GGAAACAGGG 


GAAGATAGCA 


AGGCCTCTTC 


ATCATTATCA 


CTGGGTGTGT 


950 


GCGTGAGGGG 


AGGGGTGTCA 


TTGCATACAC 


AGGGCAACAT 


CAGGATGGAA 


1000 


GCCTTGAGGA 


CAGAGGAACA 


TCAAAGGGAT 


CCTGAGGACA 


ACAGCTGGGA 


1050 


GCAGTTGCCA 


TCAGTGAGTG 


CCTTCTCTAA 


GTGTGGGGCC 


TTTCTCTGCC 


1100 


ACATAAATGC 


AGGCTGCCTC 


CTCTCTCCAC 


ACACTGGTCT. 


CCAGCTCACC 


1150 


AATCCTCCAG 


GTGACTTCCA 


GCCATGAAGA 


CTCTTGTCCT 


CCTCTCTGCC 


1200 


CTTGTCCTGG 


TGGCCTACCA 


GGTCCAGGCT 


GATCCCATTC 


AAGGGGCAGA 


1250 


AGAAGAGACT 


AAAACTGAAG 


AGCAACCATC 


AGATGAGGAC 


CAGGATGTGT 


1300 


CTGTCTCCTT 


TGAAGGCCCA 


GAAGCCTCTG 


CTCTTCAAGA 


TTTTGGTGAG 


1350 


TGCTTATGCA 


CAGAATGATG 


GAGGCTTGGA 


GTCTCCTGAT 


GGAGGGTTGT 


1400 


AGATTAGACC 


TGGAATCCTG 


TCAAGAACTG 


TCTGGTTCAG 


GTAGCTGTCT 


1450 


CTTGGTCCCT 


TTACATTCCT 


TGTCTTCTTC 


ATAGAAGTAA 


CGGAGAGAGA 


1500 


TTAACCATTG 


GGCTTGACTT 


TTTTCCTTTT 


AAAAMIIIG 


ATCTAACAAT 


1550 


TTATCTGTGG 


AAAACCTTTA 


AAATATAAAA 


CATATTGATT 


AGTTCTTTTA 


1600 


GACCT6ATTG 


ATAA1 1 1 IGT 


TATAAGAAGA 


AATATTCGTT 


CTACTTTAAA 


1650 


AATTAGATTT 


GGGACCCAAA 


TGCCAGTGTA 


TGAAGCTGTT 


GGGTAAGGAA 


1700 


AAACCAAAAA 


TGGTGATAGA 


ATGTTGTGTA 


GATGACAATT 


CCTTTATGCG 


1750 


ATTAACACTT 


TTTAAAATGT 


CTTATCTCCA 


TGTGTTTGGG 


GTTGATCATG 


1800 
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FIG. 15B(2) 



RAT CRYPTIN 2 GENOMIC SEQUENCE 

10 20 30 40 ' 15 

1234567890 1234567890 1234567890 1234567890 1234567890 

GTGCTGACTG TGATGTCACC CACAGAGCAA ACCTACTCTC TACCATGCAC 1850_ 

AGGACATCTT CATAGGGTAG TTCACTGTCA CACACTGCTG GCCTCGTTAC 1900 

TTCATGCCTG ATGCTTTCTT GTTTCCTCAG AGATAGGAAG GCCAGTGAGG 1950 

AGGTGCCGTT GCAGAGCAAA CTGCGGACCT AAAGAATATG CCACTGCGTT 2000 

CTGTGCTCAA GGTCCATTTA AACAGTTCAA ATTCTGCTGC ACATGAACAT 2050 

GGATCCCAAG TCTGAGATAA CCACGTGCTC TGGGACCTCA CAATCTGTCA 2100 

TTATTGTGCT TGACCTCAAC TGCTTTTCCT TCTCCAATAA ACTCCTGGCA 2150 

GACAAATAAT CGGTATATGT TTATTTGATG CTTTCTATTT GGCTTAGACA 2200 

GAACTCTCCT GAGCCATGTA GCTGAATCTT CAGTGAATCC TTTGTAAAGG 2250 

TCACATTTCA GCAGTCATAT CAAGGAT6AG CAGGAGGTTA GATACAAAGA 2300 

GACAAGATGG JCTGCGCCAG CTGCTTCTTT GTCTATCAAG TCTGCTTTCC 2350 

TTTAGATTAG AGTCACCATC AAAAATTATT CCCACATTTT CATGTTCTAT 2400 

AIM llll 2408 
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FIG. 15C(1) 

RAT CRYPTIN 3 GENOMIC SEQUENCE 

10 [ 20 30~ 40 50 

1234567890 1234567890 1234567890 1234567890 1234567890 

CCTGAGACCA ACTCTGTGAT AATCA6AAAA GACAATTATG TGTCTTAAAT 50 

GTAAGGTTTG CTTCTTGACT GATAGATCTA ACCCTACAGA GAGATTCAAG 100 

TGGTCTTGTC CCATTGAACA ATAGTATATA TGTTTTATAT ATATATATAT 150 

ATATATGTAT ATGTATATAT ATATGTGTGT GTGTGTGTGT GTGTGTCTGT 200 

GTCTGTGTGT CTGTGTGTCT GTGTGTCTGT GTGTCTGTGT GTGTATGTGT 250 

GTGTATGTGT ACATATGTTC AATATGTCTG TAAAATAGTA TTCTTGTAGC 300 

TTCACTTACT TTTGCACAGA GCTGTAAATA AGAACATTGT AGCCAATGAA 350 

TAGTATTTAT TAACATGTAA ATAGGAGCTG GCACCTCTGA CAGTGGGACT 400 

CCATACAGTG ACTGTAAACA ACAGGATGCT CTAGACCTTT TGCTGTGTGT 450 

GTGGTGAGAG ACATGGGATA AACACAGACT GAAGTGTATG ACATGGCGGC 500 

ACTTCTCGAG ACCGGGTAGC AGCTTCTGAG CCTCTCTACA TTGTGGATGT 550 

CCTTTCCTGT AGGTCAGGTC TCATTGTCTA AAAGTAAAAG CATTGCAGCA 600 

TCTCAGACCT GGGAAACACC CCATGGCTTG AGGGTCCCGC AGGTGAAGAG 650 - 

CCACCTGGAG CTCACTCTTG GCAGATGTGT TCCATGACTT TGGCTTCTTC 700 

AGAACCACCC ACTACAGCTT CACTCTGACA AATCTTAGAA ACTTGAACTC 750 

AATTCACTGG AGGGCACAAT AAAGCCATCT TACTTTCTCT AAAATGGCCC 800 

CAAAGGAGGG GATTCACAAA GTTTGCCTTG ATGAGGACCA TTGCTAATAC 850 

CCCAAAACTT GCAAAAAAAA TTGAGTGTCC AGTCAACCTG GTCAAGGACT 900 

GGTCCTGGAT CCACAGTTTC TGAGAAAAGA AGGCTCCAAC TTCAAAACAC 950 

AAACCACTCC TGTTCTATGC GCTCATTAAA TTAGGCAGTG TTAAGCTATT 1000 

TTCTTTCCTG ACCACTGAGA GGTAAATACT CAAGCAGATG GGAAACAGGG 1050 

GAGGACAGCA AAGCCTGTTC ATCATTATCA GTGGGAGTGT GCGTGAGGGG 1100 

AGGGGTGTCA GTGAACACAC AGAGCATCAG GAAGGAAGCC TTGAGGACAG 1150 

AGGAACATCA AAGGGATCCT GAGGACAACA GCTGGGAGCA GTTGGCATCA 1200 

GTGAGTGCCG TCTCTAAGTG TGGGGCCTTT CTCTGCCACA TAAATGCAGG 1250 

CTGGCTCCTC TCTCCACACA CTGGTCTCCA GCTCACCAAT CCTCCAGGTG 1300 

ACTTCCAGCC ATGAAGACTC TTGTCCTCCT CTCTGCCCTT GTCCTGCTGG 1350 

CATTCCAGAT CCAGGCTGAT CCCATTCAAG AGGCAGAAGA AGAGACTAAA 1400 

ACTGAGGAGC AGCCAGCAGA TGAGGACCAG GATGTGTCTG TCTCCTTTGA 1450 

AGGCCCAGAA CCCTCTGCTC TTCAAAATTT 'AGGTGCGTGC TTGTGCACAG 1500 

AATGATGGAG GCTTGGAGTC TCCTGATGGA GGGTTGTAGA TTAGCCCTGG 1550 

AGTCCTGTCA AGGACAGTCT GGTTCAGGTA GCTGTCTATT GATCCTTTCA 1600 

GAACTTCCCT GTCTTATTCA TAGAAATAAC AGTGAGAGAC AAGCCATTGG 1650 

GCTTGACTTT TTCCTTTTAA GATTTTGGTC TAACAATTTA TCTGTGAAAA 1700 

ACCTTTAAAA TATAAAACAT ATTGATTAGT TCTTTTAAAC CTGATTGATA 1750 

ATTTTGTTAT AGGAAGAAAT AACTGTTCTA CTTTAAAAAT TAGATTTGGT 1800 

ACCTAAATGC CAGTGTATTA AGGTGTTGGG TCAGGAAAAC ACAATAATGC 1850 

TGATAGAATG TGGTGTAGAT GACAATTCCT ATATGCGATT AACACTTGTT 1900 
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FIG. 15C(2) 



RAT CRYPTIN 3 GENOMIC SEQUENCE 

10 . 20 30 ' 4CT" ~~ 50 

1234567890 1234567890 1234567890 1234567890 1234567890 

AMTTGTCCT ATCTCCATGT GTTTGGGGTT GATCATGGTG CTGGCTGTGA 1950 

TGTCACCCAC ACAGCAAACC TACTTTCTAC CATGCACAGG ACATCTTCAT 2000 

AGGGTAGTTC ACTGTCACAC ACTGCTGGCC TCCTTACTTC ATGCCTGATG 2050_ 

CTTTCTCGTT TCCTCAGAGA TCAGATGGCC ATGGAAGAGG TGCCATTGCA 2100 

GAAGTTTCTG CAGACCTTAT GAAAATGCCA CTTCGTTCTG TGCTCAAGGT 2150 

CTATTTAAAC AACACAAATT CTGCTGCCTA GAAACATGGC CCCCAAGGAT 2200 

GAAATAACCA CGTGCTCTGG GACCTCACAA TCTGTCATCA TTGTGCTTGG 2250 

CCTCAACTTC TTTTCCTTCT CCAATAAACT CCTTGCAGAC AAATAACCTG 2300 

TTTATGTTTT TTTGATGCTT TCTATGTGGC TTAGACAGGG CTCTCCTGAG 2350 

CCATGTAGCA GAATCTTCAG TGAATCCTTT GTAAAAGAAG TCTTGGTCAC 2400 

ATTTCAACAG TCATATCAAG GATGAGCAGG AGGTTAGATC CAAAGAGACA 2450 

AGATGCTCTG CTCCAGCTGC TTCTTGACTA TCAAGTCTTC TGTCCTTCAG 2500 

ATTAGAGTCA CCCTCAAAAA TTAGTCCCAC CTTTTCATGT TCTAI 1 1 1 1 1 2550 

T 2551 
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